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VASCULAR ENDOTHELIAL GROWTH FACTOR-X 

The present invention is concerned with a novel 
vascular endothelial growth factor (VEGF) herein 
5 designated *VEGF-X", and characterisation of the 
nucleic acid and amino acid sequences of VEGF-X. 

Introduction 

10 Angiogenesis involves formation and proliferation of 
new blood vessels, and is an essential physiological 
process for normal growth and development of tissues 
in, for example, embryonic development, tissue 
regeneration and organ and tissue repair. 

15 Angiogenesis also features in the growth of human 

cancers which require continuous stimulation of blood 
vessel growth- Abnormal angiogenesis is associated 
with other diseases such as rheumatoid arthritis 
psoriasis and diabetic retinopathy. 

20 

Capillary vessels consist of endothelial cells which 
carry the genetic information necessary to proliferate 
to form capillary networks. Angiogenic molecules 
which can initiate this process have previously been 

25 characterised. A highly selective mitogen for 

vascular enothelial cells is vascular endothelial 
growth factor (VEGF) (Ferrara et al., "Vascular 
Endothelial Growth Factor: Basic Biology and Clinical 
Implications". Regulation of angiogenesis, by I.D. 

0 Goldberg and E.M. Rosen 1997 Birkhauser Verlag 

Basle/Switzerland) . VEGF is a potent vasoactive 
protein which is comprised of a glycosylated cationic 
4 6-4 9 kd dimer having two 24 kd subunits. It is 
inactivated by sulfhydryl reducing agents and is 

5 resistant to acidic pH and to heating and binds to 
immobilised heparin. 
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VEGF-A has four different forms of 121, .165/ 189 and 
206 amino acids respectively due to alternative 
splicing. VEGF121 and VEGF165 are soluble and are 
capable of promoting angiogenesis, whereas VEGF189 and 
5 VEGF206 are bound to heparin containing proteoglycans 
in the cell surface. The temporal and spatial 
expression of VEGF has been correlated with 
physiological proliferation of the blood vessels 
(Gajdusek, CM. , and Carbon, S.J., Ceil Physiol-, 

10 139:570-579, (1989)); McNeil/ P.L., Muthukrishnan, L., 
Warder, E,, D'Amore, P. A., J. Cell. Biol., 109:811- 
822, (1989) ) . Its high affinity binding sites are 
localized only on endothelial cells in tissue sections 
(Jakeman, L.B., et al., Clin. Invest. 89:244-253 

15 (1989)). The growth factor can be isolated from 

pituitary cells and several tumor cell lines, and has 
been implicated in some human gliomas (Plate, K.H. 
Nature 359:845-848, (1992)). The inhibition of VEGF 
function by anti-VEGF monoclonal antibodies was shown 

20 to inhibit tumor growth in immune-deficient mice (Kim, 
K.J., Nature 362:841-844, (1993)). 

VEGF proteins have been described in the following 
patents and applications all of which are hereby 

25 incorporated by reference EP-0, 506, 477 , WO-95/24473, 
WO-98/28621, WO-90/13649, EP-0, 476, 983, EP-0, 550, 296, 
WO-90/13649, WO-96/26736, WO-96/27007, WO-98/49300, 
WO-98/36075, WO-98/840124 , WO-90/11084, WO-98/24811, 
WO-98/10071, WO-98/07832, WO-98/02543, WO-97/05250, 

30 WO-91/02058, WO-96/39421, WO-96/39515, WO-98/16551. 

The present inventors have now identified a further 
vascular endothelial growth factor, designated herein 
as "VEGF-X", and the nucleic acid sequence encoding 
35 it, which has potentially significant benefits for the 
treatment of tumours and other conditions mediated by 
inappropriate angiogenic activity. 
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Summary of the Invention 



In the present application, there is provided a novel 
vascular endothelial growth factor, herein designated 
5 *VEGF-X", nucleic acid molecules encoding said growth 
factor, an expression vector comprising said nucleic 
acid molecule, a host cell transformed with said 
vector and compounds which inhibit or enhance 
angiogenesis. Also provided is the sequence of a CUB 
10 domain present in the sequence of VEGF-X which domain 
itself prevents angiogenesis and which is used to 
treat diseases associated with inappropriate 
vascularisation or angiogenesis. 



15 



20 



25 



30 



Detailed Dftg^ irrbion of the Invention 

Therefore, according to a first aspect of the present 
invention there is provided a nucleic acid molecule 
encoding a VEGF-X protein or a functional equivalent, 
fragment, derivative or bioprecursor thereof, said 
protein comprising the amino acid sequence from 
position 23 to 345 of the amino acid sequence 
illustrated in Figure 10. Alternatively, the nucleic 
acid molecule of the invention encodes the complete 
sequence identified in Figure 10 and which 
advantageously includes a signal peptide to express 
said protein extracellularly. Preferably, the nucleic 
acid molecule is a DNA and even more preferably a cDNA 
molecule. Preferably, the nucleic acid molecule 
comprises the nucleotide sequence from position 257 to 
1291 of the nucleotide sequence illustrated in Figure 
9. In a preferred embodiment the nucleic acid is of 
mammalian origin and even more preferably of human 
origin. 



35 



accordance with the present invention a functional 
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equivalent should be taken to mean a protein, or a 
sequence of amino acids that have similar function to 
the VEGF-X protein of the invention. 

5 Also provided by this aspect of the present invention 
is a nucleic acid molecule such as an antisense 
molecule capable of hybridising to the nucleic acid 
molecules according to the invention under high 
stringency conditions, which conditions would be well 
10 known to those skilled in the art. 

Stringency of hybridisation as used herein refers to 
conditions under which polynucleic acids are stable. 
The stability of hybrids is reflected in the melting 
15 temperature (Tm) of the hybrids. Tm can be 
approximated by the formula: 

81 . 5°C+16 . 6 <log 10 [Na*] +0.41 ( %G&C) -600/1 

20 wherein 1 is the length of the hybrids in nucleotides. 
Tm decreases approximately by 1-1. 5°C with every 1% 
decrease in sequence homology. 

The term '"stringency" refers to the hybridisation 
25 conditions wherein a single-stranded nucleic acid 

joins with a complementary strand when the purine or 
pyrimidine bases therein pair with their corresponding 
base by hydrogen bonding. High stringency conditions 
favour homologous base pairing whereas low stringency 
30 conditions favour non-homologous base pairing. 

"Low stringency" conditions comprise/ for example, a 
temperature of about 37°C or less, a formamide 
concentration of less than about 50%, and a moderate 
35 to low salt (SSC) concentration; or, alternatively, a 
temperature of about 50°C or less, and a moderate to 
high salt (SSPE) concentration, for example 1M NaCl. 
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w High stringency" conditions comprise, for example, a 
temperature of about 42°C or less, a formamide 
concentration of less than about 20%, and a low salt 
(SSC) concentration; or, alternatively, a temperature 
5 of about 65°C, or less, and a low salt (SSPE) 
concentration. For example, high stringency 
conditions comprise hybridization in 0.5 M NaHPO«, 7% 
sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65°C 
(Ausubel, F.M. et a2. Current Protocols in Molecular 
10 Biology, Vol, I, 1989; Green Inc. New York, at 
2.10.3) . 

"SSC" comprises a hybridization and wash solution. A 
stock 20X SSC solution contains 3M sodium chloride, 
15 0.3M sodium citrate, pH 7.0. 

"SSPE" comprises a hybridization and wash solution. A 
IX SSPE solution contains 180 mM NaCI, 9mM Na 2 HP0 4 and 
1 mM EDXA, pH 7.4. 

20 

The nucleic acid capable of hybridising to nucleic 
acid molecules according to the invention will 
generally be at least 70%, preferably at least 80 or 
90% and more preferably at least 95% homologous to the 
25 nucleotide sequences according to the invention. 

The antisense molecule capable of hybridising to the 
nucleic acid according to the invention may be used as 
a probe or as a medicament or may be included in a 
30 pharmaceutical composition with a pharmaceutical^ 
acceptable carrier, diluent or excipient therefor. 

The term ^homologous" describes the relationship 
between different nucleic acid molecules or amino acid 
15 sequences wherein said sequences or molecules are 

related by partial identity or similarity at one or 
more blocks or regions within said molecules or 
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sequences < 



The present invention also comprises within its scope 
proteins or polypeptides encoded by the nucleic acid 
5 molecules according to the invention or a functional 
equivalent, derivative or bioprecursor thereof. 

Therefore, according to a further aspect of the 
present invention, there is provided a VEGF-X protein, 

10 or a functional equivalent, derivative or bioprecursor 
thereof, comprising an amino acid sequence from 
position 23 to 345 of the sequence as illustrated in 
Figure 10, or alternatively which amino acid sequence 
comprises the complete sequence of Figure 10. A 

15 further aspect of the invention comprises a VEGF-X 
protein, or a functional equivalent, derivative or 
bioprecusor thereof, encoded by a nucleic acid 
molecule according to the invention. Preferably, the 
VEGF-X protein encoded by said nucleic acid molecule 

20 comprises the sequence from position 23 to 345 of the 
amino acid sequence as illustrated in Figure 10, or 
which sequence alternatively comprises the sequence of 
amino acids of Figure 10. 

25 The DNA molecules according to the invention may, 

advantageously, be included in a suitable expression 
vector to express VEGF-X encoded therefrom in a 
suitable host. Incorporation of cloned DNA into a 
suitable expression vector for subsequent 

30 transformation of said cell and subsequent selection 
of the transformed cells is well known to those 
skilled in the art as provided in Sambrook et al. 
(1989), molecular cloning, a laboratory manual, Cold 
Spring Harbour Laboratory Press. 



35 



An expression vector according to the invention 
includes a vector having a nucleic acid according to 
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the invention operably linked to regulatory sequences, 
such as promoter regions, that are capable of 
effecting expression of said DNA fragments. The term 
"operably linked" refers to a juxta position wherein 
5 the components described are in a relationship 

permitting them to function in their intended manner. 
Such vectors may be transformed into a suitable host 
cell to provide for expression of a polypeptide 
according to the invention. Thus, in a further 

10 aspect, the invention provides a. process for preparing 
polypeptides according to the invention which 
comprises cultivating a host cell, transformed or 
transfected with an expression vector as described 
above under conditions to provide for expression by 

15 the vector of a coding sequence encoding the 
polypeptides, and recovering the expressed 
polypeptides . 

The vectors may be, for example, plasmid, virus or 
20 phage vectors provided with an origin of replication, 
and optionally a promoter for the expression of said 
nucleotide and optionally a regulator of the promoter. 

The vectors may contain one or more selectable 
25 markers, such as, for example, ampicillin resistance. 

Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 

30 binding. For example, a bacterial expression vector 
may include a promoter such as the lac promoter and 
for translation initiation the Shine-Dalgarno sequence 
and the start codon AUG. Similarly, a eukaryotic 
expression vector may include a heterologous or 

35 homologous promoter for RNA polymerase II, a 

downstream polyadenylation signal, the start codon 
AUG, and a termination codon for detachment of the 
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ribosome. Such vectors may be obtained commercially 
or assembled from the sequences described by methods 
well known in the art. 

5 Nucleic acid molecules according to the invention may 
be inserted into the vectors described in an antisense 
orientation in order to provide for the production of 
antisense RNA. Antisense RNA or other antisense 
nucleic acids may be produced by synthetic means. 

10 

In accordance with the present invention, a defined 
nucleic acid includes not only the identical nucleic 
acid but also any minor base variations including in 
particular substitutions in cases which result in a 

15 synonymous codon (a different codon specifying the 

same amino acid residue) due to the degenerate code in 
conservative amino acid substitutions. The term 
"nucleic acid sequence" also includes the 
complementary sequence to any single stranded sequence 

20 given regarding base variations. 

The present invention also advantageously provides 
nucleic acid sequences of at least approximately 10 
contiguous nucleotides of a nucleic acid according to 

25 the invention and preferably from 10 to 50 nucleotides 
even more preferably, the nucleic acid sequence 
comprise the sequences illustrated in Figure 3. These 
sequences may, advantageously be used as probes or 
primers to initiate replication, or the like. Such 

30 nucleic acid sequences may be produced according to 
techniques well known in the art/ such as by 
recombinant or synthetic means. They may also be used 
in diagnostic kits or the like for detecting the 
presence of a nucleic acid according to the invention. 

35 These tests generally comprise contacting the probe 
with the sample under hybridising conditions and 
detecting for the presence of any duplex or triplex 
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formation between the probe and any nucleic acid in 
the sample. 



10 



15 



20 



25 



30 



35 



The nucleic acid sequences according to this aspect of 
the present invention comprise the sequences of 
nucleotides illustrated in Figures 3 and 5. 

According to the -present invention these probes may be 
anchored to a solid support. Preferably, they are 
present on an array so that multiple probes can 
simultaneously hybridize to a single biological 
sample. The probes can be spotted onto the array or 
synthesised in sltv on the array. (See Lockhart et 
al., Nature Biotechnology, vol. 14, December 1996 
"Expression monitoring by hybridisation to high 
density oligonucleotide arrays''. A single array can 
contain more than 100, 500 or even 1,000 different 
probes in discrete locations. 

The nucleic acid sequences, according to the invention 
may be produced using such recombinant or synthetic 
means, such as for example using PCR cloning 
mechanisms which generally involve making a pair of 
primers, which may be from approximately 10 to 50 
nucleotides to a region of the gene which is desired 
to be cloned, bringing the primers into contact with 
mRNA, cDNA, or genomic DNA from a human cell, 
performing a polymerase chain reaction under 
conditions which brings about amplification of the 
desired region, isolating the amplified region or 
fragment and recovering the amplified DNA. Generally, 
such techniques are well known in the art, such as 
described in Sambrook et ai. (Molecular Cloning: a 
Laboratory Manual, 1989) . 

The nucleic acids or oligonucleotides according to the 
invention may carry a revealing label. Suitable 
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labels include radioisotopes such as 32 P or 35 S, enzyme 
labels or other protein labels such as biotin or 
fluorescent markers. Such labels may be added to the 
nucleic acids or oligonucleotides of the invention and 
5 may be detected using known techniques per se. 

Advantageously, human allelic variants or 
polymorphisms of the DNA molecule according to the 
invention may be identified by, for example, probing 

10 cDNA or genomic libraries from a range of individuals, 
for example, from different populations. Furthermore, 
nucleic acids and probes according to the invention 
may be used to sequence genomic DNA from patients 
using techniques well known in the art, such as the 

15 Sanger Dideoxy chain termination method, which may, 
advantageously, ascertain any predisposition of a 
patient to certain disorders associated with a growth 
factor according to the invention. 

20 The protein according to the invention includes all 
possible amino acid variants encoded by the nucleic 
acid molecule according to the invention including a 
polypeptide encoded by said molecule and having 
conservative amino acid changes. Conservative amino 

25 acid substitution refers to a replacement of one or 
more amino acids in a protein as identified in Table 
1. Proteins or polypeptides according to the invention 
further include variants of such sequences, including 
naturally occurring allelic variants which are 

30 substantially homologous to said proteins or 

polypeptides. In this context, substantial homology 
is regarded as a sequence which has at least 70%, 
preferably 80 or 90% and preferably 95% amino acid 
homology with the proteins or polypeptides encoded by 

35 the nucleic acid molecules according to the invention. 
The protein according to the invention may be 
recombinant, synthetic or naturally occurring, but is 
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preferably recombinant. 

The nucleic acid or protein according to the invention 
may be used as a medicament or in the preparation of a 
medicament for treating cancer or other diseases or 
conditions associated with expression of VEGF-X 
protein. 

Advantageously, the nucleic acid molecule or the 
protein according to the invention may be provided in 
a pharmaceutical composition together with a 
pharmacologically acceptable carrier, diluent or 
excipient therefor. 

15 The present invention is further directed to 

inhibiting VEGF-X in vivo by the use of antisense 
technology, Antisense technology can be used to 
control gene expression through triple-helix formation 
of antisense DNA or RNA, both of which methods are 
20 based on binding of a polynucleotide to DNA or RNA. 

For example, the 5' coding portion or the mature DNA 
sequence; which encodes for the protein of the present 
invention, is used to design an antisense RNA 
oligonucleotide of from 10 to 50 base pairs in length. 
25 A DNA oligonucleotide is designed to be complementary 
to a region of the gene involved in transcription 
(triple-helix - see Lee et al* Nucl. Acids Res., 
6:3073 (1979); Cooney et ai., Science, 241:456 (1988); 
and Dervan et al., Science, 251: 1360 (1991), thereby 
preventing transcription and the production of VEGF-X. 
The antisense RNA oligonucleotide hybridises to the 
mRNA in vivo and blocks translation of an mRNA 
molecule into the VEGF-X protein (antisense - Okano, 
J- Neurochenw, 56:560 (1991); Oligodeoxynucleotides as 
35 Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). 



30 
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Alternatively, the oligonucleotide described above can 
be delivered to cells by procedures in the art such 
that the anti-sense RNA and DNA may be expressed in 
vivo to inhibit production of VEGF-X in the manner 
5 described above. 

Antisense constructs to VEGF-X, therefore, may inhibit 
the angiogenic activity of VEGF-X and prevent the 
further growth of or even regress solid tumours, since 
10 angiogenesis and neovascularization are essential 
steps in solid tumour growth. These antisense 
constructs may also be used to treat rheumatoid 
arthritis, psoriasis and diabetic retinopathy which 
are all characterized by abnormal angiogenesis. 

15 

A further aspect of the invention provides a host cell 
or organism, transformed or transfected with an 
expression vector according to the invention. The 
host cell or organism may advantageously be used in a 
20 method of producing VEGF-X, which comprises recovering 
any expressed VEGF-X from the host or organism 
transformed or transfected with the expression vector. 

According to a further aspect of the invention there 
25 is also provided a transgenic cell, tissue or organism 
comprising a transgene capable of expressing VEGF-X 
protein according to the invention. The term 
^transgene capable of expression" as used herein means 
a suitable nucleic acid sequence which leads to 
30 expression of VEGF-X or proteins having the same 

function and/or activity. The transgene, may include, 
for example, genomic nucleic acid isolated from human 
cells or synthetic nucleic acid, including DNA 
integrated into the genome or in an extrachromosomal 
35 state. Preferably, the transgene comprises the 

nucleic acid sequence encoding the proteins according 
to the invention as described herein, or a functional 



fragment of said nucleic acid. A functional fragment 
of said nucleic acid should be taken to mean a 
fragment of the gene comprising said nucleic acid 
coding for the proteins according to the invention or 
a functional equivalent, derivative or a non- 
functional derivative such as a dominant negative 
mutant, or bioprecursor of said proteins. For 
example, it would be readily apparent to persons 
skilled in the art that nucleotide substitutions or 
deletions may be used using routine techniques, which 
do not affect the protein sequence encoded by said 
nucleic acid, or which encode a functional protein 
according to the invention. 

VEGF-X protein expressed by said transgenic cell, 
tissue or organism or a functional equivalent or 
bioprecursor of said protein also forms part of the 
present invention. 



Antibodies to the protein or polypeptide of the 
present invention may, advantageously, be prepared by 
techniques which are known in the art. For example, 
polyclonal antibodies may be prepared by inoculating a 
host animal, such as a mouse or rabbit, with the 
polypeptide according to the invention or an epitope 
thereof and recovering immune serum. Monoclonal 
antibodies may be prepared according to known 
techniques such as described by Kohler R. and Milstein 
C, Nature (1975) 256, 495-497. Advantageously, such 
antibodies may be included in a kit for identifying 
VEGF-X in a sample, together with means for contacting 
the antibody with the sample - 

Advantageously, the antibody according to the 
invention may also be used as a medicament or in the 
preparation of a medicament for treating tumours or 
other diseases associated with expression of VEGF-X. 
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The invention also further provides a pharmaceutical 
composition comprising said antibody together with a 
pharmaceutical^ acceptable carrier diluent or 
excipient therefor. 

5 

Proteins which interact with the polypeptide of the 
invention may be identified by investigating protein- 
interactions using the two-hybrid vector system first 
proposed by Chien et al., (1991) Proc. Natl. Acad. 
10 Sci. OSA 88 : 9578-9582. 

This technique is based on functional reconstitution 
in vivo of a transcription factor which activates a 
reporter gene. More particularly the technique 

15 comprises providing an appropriate host cell with a 
DNA construct comprising a reporter gene under the 
control of a promoter regulated by a transcription 
factor having a DNA binding domain and an activating 
domain, expressing in the host cell a first hybrid DNA 

20 sequence encoding a first fusion of a fragment or all 
of a nucleic acid sequence according to the invention 
and either said DNA binding domain or said activating 
domain of the transcription factor, expressing in the 
host at least one second hybrid DNA sequence, such as 

25 a library or the like, encoding putative binding 
proteins to be investigated together with the DNA 
binding or activating domain of the transcription 
factor which is not incorporated in the first fusion; 
detecting any binding of the proteins to be 

30 investigated with a protein according to the invention 
by detecting for the presence of any reporter gene 
product in the host cell; optionally isolating second 
hybrid DNA sequences encoding the binding protein. 

35 An example of such a technique utilises the GAL 4 

protein in yeast. GAL4 is a transcriptional activator 
of galactose metabolism in yeast and has a separate 
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domain for binding to activators upstream of the 
galactose metabolising genes as well as a protein 
binding domain. Nucleotide vectors may be 
constructed, one of which comprises the nucleotide 
residues encoding the DNA binding domain of GAL 4 . 
These binding domain residues may be fused to a known 
protein encoding sequence, such as for example, the 
nucleic acids according to the invention. The other 
vector comprises the residues encoding the protein 
binding domain of GAL4. These residues are fused to 
residues encoding a test protein. Any interaction 
between polypeptides encoded by the nucleic acid 
according to the invention and the protein to be 
tested leads to transcriptional activation of a 
reporter molecule in a GAL- 4 transcription deficient 
yeast cell into which the vectors have been 
transformed. Preferably, a reporter molecule such as 
p-galactosidase is activated upon restoration of 
transcription of the yeast galactose metabolism genes. 

A further aspect of the present invention also 
provides a method of identifying VEGF-X in a sample, 
which method comprises contacting said sample with an 
antibody according to the invention and monitoring for 
any binding of any proteins to said antibody. A kit 
for identifying the presence of VEGF-X in a sample is 
also provided comprising an antibody according to the 
invention and means for contacting said antibody with 
said sample. 

VEGF-X may be recovered and purified from recombinant 
cell cultures by methods known in the art, including 
ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, 
hydroxyapatite chromatography and lectin 
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chromatography . 

The VEGF-X protein of the present invention may be a 
naturally purified product, or a product of chemical 
5 synthetic procedures, or produced by recombinant 

techniques from a prokaryotic or eukaryotic host (for 
example, by bacterial yeast, higher plant, insect and 
mammalian cells in culture) . Depending upon the host 
employed in a recombinant production procedure, the 
10 polypeptides of the present invention may be 

glycosylated with mammalian or other eukaryotic 
carbohydrates or may be non-glycosylated. 

VEGF-X is particularly advantageous as a wound healing 
15 agent, where, for example, it is necessary to re- 

vascularize damaged tissues, or where new capillary 
angiogenesis is important- Accordingly, VEGF-X may be 
used for treatment of various types of wounds such as 
for example, dermal ulcers, including pressure sores, 
20 venous ulcers, and diabetic ulcers. In addition, it 
can be used in the treatment of full-thickness burns 
and injuries where angiogenesis is desired to prepare 
the burn in injured sites for a skin graft and flap. 
In this case, VEGF-X or the nucleic acid encoding it 
25 may be applied directly to the wound. VEGF-X may be 
used in plastic surgery when reconstruction is 
required following a burn, other trauma, or even for 
cosmetic purposes. 

30 An important application of VEGF-X is to induce the 
growth of damaged bone, periodontium or ligament 
tissue. For example, it may be used in periodontal 
disease where VEGF-X is applied to the roots of the 
diseased teeth, leading to the formation of new bone 

35 and cementum with collagen fibre ingrowths- It can be 
used for regenerating supporting tissues of teeth, 
including alveolar bone, cementum and periodontal 
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ligament, that have been damaged by disease and 
trauma. 

Since angiogenesis is important in keeping wounds 
5 clean and non-infected, VEGF-X may be used in 

association with surgery and following the repair of 
cuts. It should be particularly useful in the 
treatment of abdominal wounds where there is a high 
risk of infection. 

0 

VEGF-X can also be used for the promotion of 
endothelialization in vascular graft surgery. In the 
case of vascular grafts using either transplanted or 
synthetic material, VEGF-X may be applied to the 

5 surface of the graft or at the junction to promote the 
growth of the vascular endothelial cells. One 
derivation of this is that VEGF-X can be used to 
repair the damage of myocardial and other occasions 
where coronary bypass surgery is needed by stimulating 

3 the growth of the transplanted tissue. Related to 
this is the use of VEGFX to repair the cardiac 
vascular system after ischemia. 

The protein of the present invention may also be 
) employed in accordance with the present invention by 
expression of such protein in vivo, which is often 
referred to as v gene therapy". 

Thus, for example, cells such as bone marrow cells may 
be engineered with a polynucleotide (DN7V or RNA) 
encoding for the protein ex vivo as defined herein, 
the engineered cells are then provided to a patient to 
be treated with the polypeptide. Such methods are 
well-known in the art. For example, cells may be 
engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding for the 
protein of the present invention. 



WO 00/37641 



- 18 - 



PCT/US99/30503 



Similarly, cells may be engineered in vivo for 
expression of the protein in vivo, for example, by 
procedures known in the art. 

5 A further aspect of the invention comprises a method 
of treating a disorder mediated by expression of a 
protein according to the invention, by administering 
to a patient an amount of an antisense molecule as 
described herein, in sufficient concentration to 
10 alleviate or reduce the symptoms of said disorder. 

Compounds which inhibit or enhance angiogenesis may be 
identified by providing a host cell or organism 
according to the invention or a transgenic cell, 

15 tissue or organism according to the invention, 

contacting a test compound with said cell, tissue or 
organism and monitoring for the effect of said 
compound compared to a cell tissue or organism which 
has not been contacted with said compound. These 

20 compounds may themselves be used as a medicament or 

included in a pharmaceutical composition for treatment 
of disorders mediated by inappropriate vascularisation 
or angiogenic activity. 

25 The present inventors have also, advantageously, 

identified in the sequence encoding the VEGF-X protein 
a CUB domain, which has heretofore not previously been 
identified in VEGF-type growth factors- The VEGF-X 
protein may therefore exert dual regulatory effects 

30 via interaction with the VEGF tyrosine kinase 

receptors or with neuropilin receptors mediated by the 
CUB domain. Thus, the sequence encoding said CUB 
domain may be included in an expression vector for 
subsequent transformation of a host cell, tissue or 

35 organism. 

VEGF-X or fragments thereof may be able to modulate 
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the effects of pro-angiogenic growth factors such as 
VEGF as indicated in the findings presented in the 
examples below that the N-terminal part of the VEGF-X 
protein, a CUB-like domain, is able to inhibit VEGF- 
5 stimulated proliferation of HOVECs. VEGF-X or 

fragments thereof may therefore be useful in therapy 
of conditions involving inappropriate angiogenesis. 
Inhibition of the angiogenic activity of VEGF has 
been linked with inhibition of tumour growth in 

10 several models eg Kim K. J. et al, Nature 362:841- 
844, (1993). Additionally, agents able to inhibit 
angiogenesis would be expected to be useful in 
treating other angiogenesis-dependent diseases such a 
retinopathy, osteoarthritis and psoriasis (Folkman, 

15 J., Nature Medicine 1:27-31, (1995). 

As identified in more detail in the Examples 
described herein the present inventors have 
surprisingly identified that the CUB domain of VEGF-X 

20 is able to inhibit stimulation of proliferation of 

HUVECs induced by either VEGF or bFGF. The CUB domain 
may, therefore, be utilised as a therapuetic agent 
for inhibition of angiogenesis and for treatment of 
condition associated with inappropriate 

25 vascularisation or angiogenesis. 

Therefore according to a further aspect of the 
invention there is provided a method of inhibiting 
angiogenic activity and inappropriate vascularisation 

*0 including formation and proliferation of new blood 
vessels, growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
an amount of a polypeptide having an amino acid 

5 sequence from position 40 to 150 of the sequence 

illustrated in Figure 10 or a nucleic acid molecule 
encoding the CUB domain according to the invention in 
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sufficient concentration to reduce or prevent said 
angiogenic activity. 

Furthermore there is also provided a method of 
5 treating or preventing any of cancer, rheumatoid 

arthritis, psoriasis and diabetic retinopathy, said 
method comprising administering to said subject an 
amount of a polypeptide having an amino acid sequence 
from position '40 to 150 of the sequence illustrated 
10 in Figure 10 or a nucleic acid molecule encoding the 
CUB domain according to the invention in sufficient 
concentration to treat or prevent said disorders. 

The CUB domain may also be used to identify compounds 
15 that inhibit or enhance angiogenic activity such as 

inappropriate vascularisation, in a method comprising 
contacting a cell expressing a VEGF receptor and/or a 
neuropilin 1 or 2 type receptor with said compound in 
the presence of a VEGF-X protein according to the 
20 invention and monitoring for the effect of said 

compound or said cell when compared to a cell which 
has not been contacted with said compound. Such 
compounds may then be used as appropriate to prevent 
or inhibit angiogenic activity to treat the disorders 
25 or conditions described herein, or in a 

pharmaceutical composition. An antibody to said COB 
domain may also be useful in identifying other 
proteins having said sequences. 



30 
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Deposited Plasmida 

Pate of Deposit Accession No. 

Plasmid VEGFX/pCR2.1 
5 1T0P0 FL 1 March 1999 LMBP 3925 

Plasmid VEGFX/pRSETB BD 

amino acids 1 March 1999 LMBP 3926 

10 G230-G345 

Plasmid VEGFX/pcR.2,1 

FL Clone 9 20 October 1999 LMBP 3977 

IS Plasmid VEGF-X CUB 

PET22b 20 December 1999 

The above plasmids were deposited at the Belgian 
Coordinated Collections of Microorganisms (BCCM) at 
20 Laboratorium Voor Moleculaire Biologie- 

Plasmidencollectie (LMBP) B-9000, Ghent, Belgium, in 
accordance with the provisions of the Budapest Treaty 
of 28 April 1977. 

25 The invention may be more clearly understood with 
reference to the accompanying example, which is 
purely exemplary, with reference to the accompanying 
drawings, wherein: 

30 Figure 1: is a DNA sequence identified in the 

Incyte LifeSeq™ database coding for a 
novel VEGF-X protein. 

is an illustration of amino acid 
sequence of the nucleic acid sequence 
of Figure 1 . 



Figure 2: 

35 
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Figure 3: 
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Figure 4 : 



Figure 5: 



Figure 6: 



Figure 1 : 



Figure 8: 

Figure 9: 
Figure 10: 

Figure 11 : 



is an illustration of PCR primer 
sequences utilised to identify the 
VEGF-X protein according to the 
invention. 

is a diagrammatic illustration of the 
spatial relationships in the VEGF-X 
sequence of the clones identified 
using the PCR primer sequences of 
Figure 3 . 

is an illustration of the nucleotide 
sequences of the 5' RACE primers used 
to 'identify the 5' end of the VEGF-X 
open reading frame. 

is an illustration of the sequence 
obtained from the RACE experiment. 

is an illustration of the nucleotide 
sequences obtained from the search of 
LifeSeq™ database using the sequence 
in Figure 6. 

is an illustration of the primers used 
to clone the entire coding sequence of 
VEGF-X. 

is an illustration of the entire 
coding sequence of VEGF-X. 

is an illustration of the predicted 
amino acid sequence of the nucleotide 
sequence of Figure 9- 

is an alignment of the sequence of 
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Figure 13: 
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Figure 14: 

20 

Figure 15: 

25 

Figure 16: 
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Figure 17: 
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Figure 10 with the sequences of VEGF-A 
to D. 

is an illustration of variant 
sequences of the VEGF-X protein 
according to the invention. 

is an illustration of the 
oligonucleotide primers used for 
E.coli expression of VEGF-X domains 
and for expression of the full length 
sequence of VEGF-X in a 
baculovirus/insect cell expression 
system. 

depicts nucleic acid sequences of 18 
human EST clones obtained from a BLAST 
search of the LifeSeq™ database used 
to identify the full sequence encoding 
VEGF-X. 

depicts the nucleotide sequences of 50 
human EST clones obtained from the 
LifeSeq™ database. 

is an illustration of nucleotide 
sequences utilised as primers to 
identify the nucleotide sequence 
encoding VEGF-X, 

is a nucleotide sequence coding for a 
partial VEGF-X protein according to 
the invention, 

is an illustration of a partial 
nucleotide sequence encoding VEGF-X 
protein according to the invention. 



WO 00/37641 



24 - 



PCT/US99/30503 



Figure 19: 

5 

Figure 20: 

10 
15 

Figure 21: 

20 

25 Figure 22: 
30 



is an illustration of a DNA and 
polypeptide sequence used for 
mammalian cell expression of VEGF-X. 
The predicted VEGF-X signal sequence 
is in lower case letters. The C- 
terminal V5 epitope and His6 sequences 
are underlined. 

is an illustration of a DNA and 
polypeptide sequence used for 
baculovirus/insect cell expression of 
VEGF-X. In the polypeptide sequence 
the signal sequence is shown in lower 
case. The N-terminal peptide tag 
added to the predicted mature VEGF-X 
sequence is underlined. 

is an illustration of a DNA and 
polypeptide sequence used for E. coli 
expression of VEGF-X. The polypeptide 
sequences at the N- and C- termini 
derived from the M3P fusion and HisS 
tag respectively are underlined. 

illustrates the disulphide-linked 
dimerisation of VEGF-X. Protein 
samples were analysed by SDS-PAGE. 
Prior to loading the gel, samples were 
heated to 95°C for 5 minutes in sample 
buffer in the presence (+) or absence 
(-) of reducing agent. (A) samples 
from COS cell expression of a C- 
terminally V5/His6 peptide-tagged 
construct. The left hand panel is 
total conditioned medium, the right 
hand panel is material purified on 
Nickel agarose resin. Reduced monomer 
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Figure 24: 
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Figure 25: 
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and putative disulphide-linked, non- 
reduced dimer are indicated by arrows. 
There appears to be proteolysis of the 
protein during purification. Gels were 
blotted onto nylon membranes and 
protein detected with an anti V5 
monoclonal antibody. (B) Samples from 
. E.colx expression of a maltose-binding 
protein/His6 dual fusion construct. M 
indicates the molecular weight markers 
(Benchmark, Lif eTechnologies) . The 
gel was stained with Coomassie Blue by 
standard procedures. The fusion 
protein has an apparent molecular 
weight of 80kDa. 

illustrates the glycosylation of VEGF- 
X. VEGF-X was purified from the 
culture supernatant of COS cells 
transfected with the pcDNA6/V5-His 
construct. Supernatants were 
harvested 72h post-transf ection and 
purified on nickel resin. Samples 
were then treated with EndoH (+) or 
untreated (-) before SDS-PAGE and 
blotting, as described in the legend 
to Figure 22. 

is an illustration of the DNA and 
polypeptide sequence used for E. coll 
expression of the VEGF-like domain of 
VEGF-x. Polypeptide sequences at the 
N-terminus of the protein derived from 
the vector are underlined. 

shows expression of the VEGF-X VEGF 
domain in E. coli. Lane l-10pl broad 
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range marker (New England Biolabs) , 
lane 2-lOpl unreduced sample, lane 3- 
lOul reduced sample. The reduced PDGF 
domain protein (lane 3) has an 
apparent molecular weight of 
approximately 19kDa on SDS-PAGE. 



Figure 26: illustrates a DNA and polypeptide 

sequence used for E. coii expression 
10 of the CUB-like domain of VEGF-X. The 

polypeptide sequence at the N-terminus 
derived from the vector-encoded signal 
and the introduced His6 tag are 
underlined. 

15 

Figure 27: shows expression of the VEGF-X CUB 

domain in JB. colx. The CUB domain 
protein was purified on Nickel chelate 
resin* The protein migrates at 
20 approximately 23kDa on SDS-PAGE. 



Figure 28: illustrates the effect of truncated 

VEGF-X (CUB domain) on HUVEC 
proliferation. (A) Human Umbilical 

25 Vein Endothelial Cells (one-day- 

treatment) . (B) Human Umbilical Vein 
Endothelial Cells (24-hour starving 
followed by one-day-treatment > . (C) 
Effect of VEGF-A US and VEGF-X CUB 

30 domain on the proliferation of HUVEC 

(two-day-treatment) . 



Figure 29: depicts the tissue distribution of 

VEGF-X mRNA analysed by Northern 
35 blotting and RT-PCR in (A) normal 

tissues and (B) tumour tissue and cell 
lines . 
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Figure 30: 

5 
10 
15 

Figure 31: 

20 

Figure 32: 

25 

30 Figure 33: 
35 



depicts the partial intron/exon 
structure of the VEGF-X gene. (A) 
Genomic DNA sequences of 2 exons 
determined by sequencing; exon 
sequence is in upper case, intron 
sequence is in lower case. (B) Shows 
the location of splice sites within 
the VEGF-X cDNA sequence. The 
location of mRNA splicing events is 
indicated by vertical lines* The 
cryptic splice donor/acceptor site at 
nt. 998/999 (diagonal lines) gives 
rise to the splice variant forms of 
VEGF-X. No splice site information is 
given for the region shown in italics. 

is a graphic representation of the 
effect of FL-VEGF-X on HuVEC 
proliferation: (24 hour serum 
starvation followed by one day 
treatment) . 

is a graphic representation of the 
combined effect of truncated VEGF-X 
(CUB domain) and human recombinant 
VEGF 165 on HuVEC proliferation: (24 hour 
serum starvation followed by two day 
treatment) . 

is a graphic representation of the 
combined effect of the CUB domain and 
human recombinant bFGF on HuVEC 
proliferation: (24 hour serum 
starvation followed by two dav 
treatment) . 



Figure 34: 



a graphic representation of the 
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results of a LDH assay for testing 
cytotoxicity of the CUB domain or the 
CUB domain with rhVEGF 16£ . 

5 Figure 35: is a graphic representation of the 

results obtained from a LDH assay for 
testing cytotoxicity of the CUB domain 
or CUB domain with rh-bFGF. 

10 A BLAST (Basic Local Alignment Search Tool; Altschul 
et al.< 1990 J. Mol. Biol. 215, 403-410) search was 
performed in the proprietary LifeSeq™ human EST 
database (Incyte Pharmaceuticals, Inc., Palo Alto, 
CA, USA) . BLAST produces alignments of both 

15 nucleotide and amino acid sequences to determine 

sequence similarity. Because of the local nature of 
the alignments, BLAST is especially useful in 
determining exact matches or in identifying 
homologues. While it is useful for matches which do 

20 not contain gaps, it is inappropriate for performing 
motif-style searching. The. fundamental unit of BLAST 
algorithm output is the High-scoring Segment Pair 
(HSP) . 

25 Eighteen human EST clones (Figure 14) with high 

similarity to the previously identified VEGF proteins 
were identified and a further fifty EST clones 
(Figure 15) were identified using these sequences as 
query sequences, allowing us to deduce the putative 

30 sequence for the new VEGF-X protein. The sequences 
obtained were compared to known sequences to 
determine regions of homology and to identify the 
sequence as a novel VEGF-type protein. Using the DNA 
sequence information in the databases we were able to 

35 prepare suitable primers having the sequences of 
VEGF-X 1-10 illustrated in Figure 3 for use in 
subsequent RACE experiments to obtain the complete 
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DNA sequence for the VEGF-X gene. 
Clonxng 

5 A profile was developed based on the VEGF-like domain 
in existing VEGF sequences (VEGF-A, B, C and D) . 
This was used to search the public databases and the 
Incyte LifeSeq™ database. No significant novel 
matching sequences were found in the oublic 
10 databases. All of the matching sequences found in 

the LifeSeq™ database (-1000) were assembled to give 
a smaller number of sequences (-30) , which included 
the known VEGFs and a potential novel VEGF (fibres 
1 and 2) . This sequence was named VEGF-X. 

Oligonucleotides were designed to amplify the VEGF-X 
sequence from cDNA (figure 3) . The ESTs found in 
LifeSeq™ were from a range of tissues, with a slight 
predominance of sequences from ovary, testis, 
placenta and lung (Figure 14 and 15) . Accordingly 
the oligonucleotides were used to amplify cDNA 
derived from lung and placenta. First-round PCR 
products were found at ~200bp larger than the 
expected sizes, while 3 major species appeared after 
a second round of pgr amplification, the smallest of 
which was of the expected size. These fragments were 
cloned and sequenced. The smallest fragment did 
indeed have the sequence originally identified from 
the LifeSeq database, while the. others contained 
insertions (figure 4) . 

As. the first round of amplification suggested that 
the major species found in cDNA from ovarv and 
Placenta was not that originally identified in the 
Lifeseq™ database, the f ocus of effort was switched 
to the presumed major species (it seemed likely that 
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clones 57, 25-27 and 2-lkb clones 1-3 in fig 4 
represented the major mRNA species). Conceptual, 
translation of the DNA sequences of these cloned PCR 
fragments indicated that the complete open reading 
5 frame was not present in the clones or in the 

sequence from LifeSeq™. While all clones contained 
the same sequence in the region of the translation 
termination codoh, indicating that the end of the 
open reading frame had been identified, the 5' end of 

10 the open reading frame had not been cloned. 5' RACE 
experiments were therefore carried out in order to 
find the start of the reading frame. PCR primers 
designed for RACE experiments are shown in figure 5. 
RACE PCR products were sequenced directly- Sequence 

15 could be obtained from the 3' end of these RACE 

products but not from the 5' end; probably because 
the products were not cloned and were therefore 
heterogeneous at the 5' end. This new sequence was 
assembled with the existing cloned sequence to give 

20 the sequence shown in figure 6. Searching the 

LifeSeq™ database with this sequence identifies ESTs 
which extend the sequence a further 140bp in the 5' 
direction and a further 160bp in the 3' direction 
(figure 7) • This longer contig was used to design 

25 oligonucleotide primers to amplify the entire coding 
sequence (these primer sequences are shown in figure 
8). PCR was carried out using primers 5'-l and 
vegfXIO (in order to clone a "full-length" cDNA) , and 
with primers 5'-l and vegfX6 (in order to clone the 

30 full coding region, see figure 3 for sequences of 
vegfXIO and vegfX6) . A number of clones were 
obtained for the shorter fragment, of which clones 4 
and 7 contain no PCR errors (sequence of clones 4 & 7 
in figure 9) - A single clone was obtained for the 

35 longer fragment (clone 9) , but this sequence appears 
to contain 2 PCR errors. 
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The predicted polypeptide from these longer contigs 
is shown in figure 10. Amino acids 1-22 are 
predicted to encode a signal sequence (von Heijne, 
1986, Nucleic Acids Res. 14, 4683-4690). Figure 11 
shows an alignment of the protein sequence with VEGFs 
A-D. The region homologous to the other VEGFs is 
located towards the C-terminus of the protein. As 
the VEGF homology domain is expected to belong to the 
TGF-beta superfamily of growth factors and to consist 
of a dimer containing both intra- and intermolecular 
disulphide bonds, initial alignments focussed on the 
cysteines. However, mapping of the sequence onto the 
known x-ray structure of the VEGF-A receptor-binding 
IS domain (Muller et al (1997) Proc. Natl. Acad. Sci USA 
94, 7192-7197) suggests that the alignment in figure 
11 is plausible, as the extra 4 cysteine residues 
within the VEGF-homology region of VEGF-X (compared 
to this region of VEGF-A) correspond to residues 
which are spatially close in VEGF-A, and may 
therefore be able to form disulphide bonds. 

A search of the PFAM database of protein domains with 
the full-length polypeptide sequence from figure 10 
identifies two domain consensus sequences within the 
polypeptide. The more C-terminal domain is a "VEGF" 
domain: (the known VEGFs all contain this domain and 
the structure of this region of VEGF-A is similar to 
that of PDGF) . Additionally towards the N-terminus 
of the polypeptide there is a CUB domain (amino acids 
-40-150) . The CUB domain is a 100-110 amino acid 
extracellular domain found in a number of 
developmentally- regulated proteins. When the full- 
length protein is used to search the protein 
databases using the BLAST 2 algorithm, the scores for 
matches to CUB domain-containing proteins are more 
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significant than those to the other VEGFs . 
Interestingly, the most significant matches are to 
the CUB domains of Neuropilins, and Neuropilin-1 was 
recently identified as a receptor of one of the VEGF- 
5 A isoforms VEGF-A 165 (Soker et al. (1998) Cell 92, 735- 
745). 

Assuming that the variant sequences isolated by PCR 
(i.e. the smaller PCR fragments) use the same 

10 translation initiation site as the full-length 

sequence, they would result in production of the 
variant proteins shown in figure 12. It may be 
significant that both of these variant proteins 
retain the CUB domain and delete all or part of the 

15 VEGF-like domain. The production of these variant 
sequences can be explained by the use of a cryptic 
splice donor/acceptor site within the VEGF-X sequence 
(figure 30B, between nt. 993/999): one variant arises 
by splicing out of the region between nt. 729-998, 

20 the other by splicing out of the region between nt. 
999-1187. 

Expression 

25 Full-length expression constructs 
Mammalian cells 

Clone 4 containing the full CDS of VEGF-X (see figure 
9) , was used to generate constructs for expression of 
full-length protein. The sequence was amplified by 

30 PCR and cloned into the vector pCDNA6/V5-His so as to 
add a C-terminal V5 epitope tag and His 6 tag. The DNA 
and polypeptide sequence in this vector is shown in 
figure 19. Transient expression in COS cells 
followed by western blotting and detection via an 

35 anti-V5 mAb demonstrates the secretion of a protein 
of -50K into the medium in transfected cells only 
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(figure 22A) . This construct can also be used to 
generate VEGF-X expressing stable CHO cell lines. 

Raculovirus/Insegt-^ U expression system 
5 For expression in the baculovirus/insect cell system 
the DNA encoding the predicted mature VEGF-X 
polypeptide sequence was fused to a sequence encoding 
a signal derived from melittin, a secreted insect 
protein. An N-terminal 6His tag was also added to 
10 facilitate purification. The insert was then cloned 
into the baculovirus expression vector pFASTBAC. The 
DNA and polypeptide sequence of this construct is 
shown in figure 20. Infection of Trlchoplusia ni Hi5 
cells with this recombinant baculovirus results in 
the secretion of a protein of approximately 4 5K into 
the medium (data not shown) . 



15 



20 



E.coli 

The coding region of VEGF-X has been cloned in a 
variety of ways for expression as a secreted protein 
in E.coli. A particularly useful expression clone 
carries an N-terminal fusion to the E.coli 
maltose-binding protein (MBP- derived from the 
expression vector pMAL-p2, New England Biolabs) and a 
25 C-terminal fusion to a 6His tag. The DNA and 

polypeptide sequence of this vector is shown in 
figure 21. Sequential purification of cell fractions 
on Ni-NTA resin and amylose resin allows the 
isolation of the expressed protein (see figure 22B) . 

30 

Expression of fragments 

VEGF 

The VEGF domain of VEGF-X has been expressed in 
E.coli. Similar domains from VEGF-A (Christinger et 
35 al. (1996) PROTEINS: Structure, Function and Genetics 
26, 353-357), and VEGF-D (Achen et al (1998) Proc. 
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Natl. Acad, Sci USA 95, 548-553) have been shown to 
be capable of binding to the respective receptors. 
Expression of these domains was carried out using the 
bacterium E.coli. Additionally, the full-length 
5 proteins was expressed using the baculovirus/insect 

cell expression system. The oligonucleotide primers 
which have been obtained for these experiments are 
shown in figure 13. The construct directed 
expression in the bacterial cytoplasm, and as 

10 expected the protein was produced in insoluble form 

in inclusion bodies (the DNA and polypeptide sequence 
used for PDGF domain expression is shown in figure 
24) . Inclusion bodies were washed, solubilized with 
urea and the protein purified under denaturing 

15 conditions, before refolding by dialysis to remove 
the urea. Soluble protein was obtained, but shows 
little evidence of the disulphide bond linked dimers 
seen with material derived from animal cells (figure 
25, compare with figure 22A & B) . It is not clear 

20 therefore whether this protein is correctly folded. 

CUB 

The CUB domain has been expressed as a soluble 
secreted protein in E.coli (figure 26). The protein 
25 was purified by binding to Ni-NTA resin (figure 27) 
and assayed for activity on KUVECs in an in-vitro 
proliferation assay. 

Properties of the VEGF-X protein 
30 The transient mammalian cell expression system 

described above has been used to generate full-length 
VEGF-X protein, as shown by antibody detection 
following Western blotting (see figure 22A) . 

35 Disulphide bond linked dimers 

The other members of the PDGF family of growth 
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factors, the PDGFs and VEGFs , all exist as dimers in 
which two monomers constituting the dimer are linked 
by interchain disulphide bonds. The x-ray structures 
of PDGF-BB (Oefner et al, 1992} , and VEGF-A (Muller 
5 et al, 1997} are known and indicate that at least 
these two members of the family contain two 
interchain disulphide bonds. Practically this means 
that in SDS-.PAGE analysis of these growth factors the 
presence of interchain disulphide bonds is shown by a 

10 large decrease in mobility in the absence of reducing 
agent (ie. the nonreduced dimer migrates more slowly 
through the gel than the reduced monomer) . This 
effect was also expected for VEGF-X, and has been 
demonstrated for the material obtained from transient 

15 mammalian cell expression (figure 22A) . In the case 
of the full length material produced in E.colx only 
some 10% of the total VEGF-X protein appears to be 
present as disulphide bond-linked dimers (figure 
22B) . However, these results provide evidence that 

20 the mammalian cell-derived protein is correctly 
folded, and that a portion of the E. coii-derived 
protein is too. 

Glvcosylation 

25 There are 3 predicted potential N-linked 

glycosylation sites within the VEGF-X protein: at 
residues 25, 55 and 254 of the polypeptide sequence. 
The predicted molecular mass of the mature VEGF-X 
protein is 4 0kDa, but SDS-PAGE and western blotting 

30 (detection via an introduced C-terminal epitope tag- 
see figure 19) of the full-length protein expressed 
in COS cells gives a band slightly larger than the 
expected size (45-50kDa) as well as one at 25kDa 
(figure 22A) . This smaller band is presumed to be a 

35 C-terminal proteolysis fragment derived from the 

full-length molecule (controls from uninfected cells 
do not show this band) , probably corresponding to a 
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cleavage between the CUB and VEGF domains • EndoH 
treatment of the preparation gives a slight mobility 
change for the full-length protein (figure 23) , but 
for the smaller VEGF domain fragment there is a clear 
5 change, indicating that the predicted glycosylation . 
site within the VEGF domain at residue 254 is indeed 
glycosylated. 

Activity of proteins in cell-based assays 
10 Protein samples were tested for activity in cell 
proliferation, cell migration and in-vitro 
angiogenesis assays- Active samples can also be 
tested in the in vivo matrigel mouse model of 
angiogenesis. 

15 

Full-lenath VEGF-X protein 

Conditioned medium derived from COS cells transiently 
expressing VEGF-X (see figure 22A) displayed no 
detectable activity in any of the assays. However, 

20 as VEGF-X protein could only be detected in this 
preparation by Western blotting/ and not by 
Coomassie-staining of gels, it is clearly present at 
very low levels and this may be the reason for the 
observed lack of activity in the cell proliferation, 

25 migration or in vitro angiogenesis tests. 

VEGF domain 

The VEGF domain protein described above has been 
tested in cell proliferation (on a range of cell 
30 types) , cell migration and in vitro angiogenesis 
assays and has failed to show activity in any of 
these tests- As suggested above, this may be due to 
incorrect folding of this protein. 



35 



CUB domain 

The CUB domain protein at the highest dose tested 



(Ipg/ml) appears to inhibit proliferation of HOVECs 
in the absence of other stimulation (figure 28A S B) . 
This effect is also seen following stimulation with 
the lowest VEGF-A ICS dose tested <lng/ml- figure 28C) . 
The CUB domain of VEGF-X therefore appears to show 
antiproliferative activity on HUVECs, even in the 
presence of low V£GF-A lw doses. 

Tissue distribution of mRNA 
VEGF-A mRNA expression has been shown to be 
upregulated in a wide variety of human tumors (lung, 
breast, ovarian, colon, stomach, liver, pancreas, 
kidney, bladder and prostate- Takahashi et al, 1995} . 
Tumor VEGF-A expression has been shown to correlate 
with tumor growth rate, microvascular density and 
tumor metastasis (Takahashi et al, 1995). It was 
thus of interest to examine the mRNA expression 
patterns of VEGF-X. Accordingly, Northern blot 
analysis of mRNA derived from different tissues has 
been carried out. The results indicate that although 
the VEGF-x mRNA is expressed at low levels, it is 
present in a wide range of tissues. PCR 
amplification of cDNA from a range of tissue sources 
supports this idea (figure 29A) . The major mRNA 
species is approximately 3.1kb in size. There is 
no significant upregulation seen in tumour cell lines 
or in tumour tissues tested (figure 29B) , with the 
possible exception of the cell lines GI-117 (lung 
carcinoma) and SaOS-2 (osteosarcoma). The results of 
these initial tissue distribution studies do not, 
therefore, provide evidence for upregulation of 
VEGF-X in tumour growth, as is seen with VEGF-A, 

Genomic structuro of the VEGF-X gene 

A genomic BAC clone covering the 3' part of the 

VEGF-X locus was isolated by hybridisation screening 
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of nylon filters containing a human BAC library. 
Direct sequencing of this clone using oligonucleotide 
primers based on the VEGF-X cDNA sequence allowed the 
determination of several intron/exon boundaries 
5 {figure 30) . Interestingly, the position of the mRNA 
splice site within the PDGF domain (nt 1187/1188 in 
figure 30B> is conserved with respect to those in the 
VEGF-A and VEGF7D genes (Tischer et al, 1991; 
Rocchigiani et al, 1998) - 

10 

Materials & Methods 

PGR, Cloning, DNA sequence determination and BAC 
screening. 

15 All primers were purchased from Eurogentec, Seraing, 
Belgium. Insert-specific sequencing primers (15- and 16- 
mers) were designed by visual inspection of the DNA 
sequences. DNA was prepared on Qiagen-tip-20 columns or 
on Qiaquick spin columns (Qiagen GmbH, Dttsseldorf , 

20 Germany) and recovered from the spin columns in 30ul 
Tris/EDTA-buffer (lOmM TrisHCl pH 7,5, 1 mM EDTA (sodium 
salt) ) - Sequencing reactions were performed using 
BigDye™ Terminator Cycle Sequencing Ready Reaction kits 
(Per kin Elmer, ABI Division, Foster City, CA, USA) and 

25 were run on an Applied Biosystems 377 DNA sequencer 
(Perkin Elmer,. ABI Division, Foster City, CA, USA) . 
Polymerase chain reactions were carried out according 
to standard procedures (Ausubel et al, 1997) . The 
PCR fragments were cloned into vectors pCR2 . 1 

30 (Invitrogen, Carlsbad, CA. USA) or pCR-TOPO 

(Invitrogen,NL) according to the manufacturer's 
instructions. One of those vectors, plasmid 
VEGFX/pCR2 . 1 1T0P0 FL 

was deposited on 1 March 1999 under Accession No. 
35 LMBP 3925. After sequence determination, the inserts 
were cloned into the desired expression vectors (see 
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figures 19, 20, 21, 24 & 26). 

A human genomic BAC library (Genome Systems, Inc., St 
Louis, MI, USA) was screened by hybridisation to 
oligonucleotides derived from the VEGF-X cDNA 
sequence, according to the manufacturer's 
instructions. BAC DNA was prepared using a Qiagen 
plasmid midi kit (Qiagen GmbH, Dttsseldorf , Germany ) 
according to the manufacturer's instructions with 
some modifications (after clearing of the lysate from 
chromosomal DNA, supernatants' from individual 
preparations were pooled on a single column (tip 
100), and after the 70 % EtOH wash, the pellet was 
resuspended overnight at 4°C in 100 pi TE) . 20-mer 
sequencing primers were designed based on the known 
cDNA sequence, and sequencing carried out as above. 

5' BACE 

In order to extend the cDNA clone in a 5' direction 
RACE reactions were carried out. since it was known 
that the mRNA is present in placenta and skeletal 
muscle, Marathon-Ready™ placenta and skeletal muscle 
comas were purchased from Clontech (Palo Alto CA. 
USA) and used according to the manufacturer's 
instructions. DNa fragments were excised from 
agarose gels, purified using QiaQuick PCR 
purification columns (Qiagen GmbH, DUsseldorf , 
Germany) and sequenced directly. 

VEGF-X protein expression and puxification 
DNA fragments encoding the desired protein sequences 
were amplified by PCR and cloned into appropriate 
expression vector systems. 

For mammalian cell expression, the full coding 
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sequence was cloned into the vector pcDNA6/V5-his 
(Invitrogen Leek, NL, see figure 19 for construct 
sequence) , so as to add a C-terminal peptide tag to 
assist in detection and purification. 

5 

For insect cell expression the sequence of the 
predicted mature polypeptide was initially amplified 
to add an N-jterminal 6His peptide and then cloned 
into the pMelBacB vector (Invitrogen, Leek, NL> to 
10 add an insect cell signal sequence. The entire 

insert was then PCR-cloned into the vector pFASTBAC-1 
(LifeTechnologies, Gaithersburg, MA, USA) for 
construction of a baculovirus according to the 
manufacturer's instructions. 

15 

For E.coli expression, the coding region was PCR 
amplified to add a C-terminal 6His tag and then 
cloned into the vector pMAL-p2 (New England Biolabs, 
Beverly, MA, USA) . The coding sequence of this 
20 construct is shown in figure 21) . The protein was 
purified first on Ni-NTA resin (Qiagen GmbH, 
DUsseldorf , Germany) and then on amylose resin (New 
England Biolabs, Beverly, MA, USA), according to the 
manufacturers' instructions - 

25 

DNA sequences encoding the CUB and VEGF domain 
fragments of VEGF-X were PCR amplified and cloned 
into pET22b and pET21a (Novagen, Madison, WI, USA) 
respectively. The CUB domain protein was prepared 

30 either from the periplasm or medium of induced 

cultures by standard methods (Ausubel et al, 1997). 
The protein was initially purified by precipitation 
with 20% ammonium sulphate. After overnight dialysis 
vs 20mM Tris Hcl pH7.5, lOOmM NaCI to remove ammonium 

35 sulphate, the protein was further purified on Ni-NTA 
resin as described above. The VEGF domain protein 
was expressed in insoluble form, and preparation of 
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inclusion bodies was carried out using standard 
procedures (Ausubel et al 1997) . Inclusion bodies 
were dissolved in 6M guanidine hydrochloride, 20mM 
Tris Hcl pH8.0, 200mM NaCl, ImM 2-mercaptoethanol, 
and purified on Ni-NTA resin (Qiagen GmbH, 
Diisseldorf, Germany) according to the manufacturer's 
instructions. The protein was refolded by dialysis 
against several .changes of buffer containing 
decreasing concentrations of denaturant. 

Analysis of protein glycosylation was carried out 
using EndoH (Roche Molecular Biochemicals, Brussels, 
BE) according to the manufacturer's instructions. 

15 Cell Proliferation Assay 

Human umbilical vein endothelial cells (HUVECs) 
(Clonetics, San Diego, CA.) were trypsinized with 
0.05% trypsin/0. 53mM EDTA (Gibco, Gaithersburg, MD. ) , 
resuspended in the EGM-2 (Clonetics, San Diego, CA. ) , ' 
counted, and distributed in a 96-well tissue culture 
plate at 5,000 cells/well. Following cell attachment 
and monolayer formation (16 hours), cells were 
stimulated with various concentrations of truncated 
VEGF-X (CUB domain or VEGF domain) or dilutions of 
culture supernatants of the full-length VEGF-X (COS 7 
or HEK293) in DMEM (Gibco, Gaithersburg, MD.) 
containing 0.5% to 2% FBS (HyClone, Logan, UT) as 
indicated. For human fetal dermal fibroblasts 
(American Type Culture Collection, Rockville, MD.), 
the growth medium was replaced by DMEM containing 
0.1% BSA (Sigma, St. Louise, MO.) with or without 
various concentrations of truncated VEGF-X proteins. 
For HCASMC (Clonetics, San Diego, CA.), the medium 
was replaced by DMEM containing 0.5% FBS. The cells 
35 were treated for a further 24 hr-72 hr. For the 

measurement of proliferation, the culture media were 
replaced with 100 pi of DMEM containing 5% FBS and 3 
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pCi/ml of [3H] -thymidine (Amersham, Arlington 
Heights, IL.). Following pulse labeling, cells were 
fixed with methanol/acetic acid (3:1, vol/vol) for 1 
hour at room temperature. The cells were washed 
5 twice with 250 pl/well of 80% methanol. The cells 
were solubilized in 0.05% trypsin (lOOpl/well) for 
30 minutes then in 0.5% SDS (100 ill/well) for another 
30 minutes. Aliquots of cell lysates (180 pi) were 
combined with 2 ml of scintillation cocktail (Fisher, 
10 Springf iled, NJ) and the radioactivity of cell 

lysates was measured using a liquid scintillation 
counter (Wallac 1409) . In each case, samples were 
performed in quadruplicate. 

15 Chemo taxis Assay 

The chemotactic response of HUVECs was assayed using 
a 48-well modified Boyden chamber (NeuroProbe, Cabin 
John, MD.) and collagen-coated (O.lmg/ml type I 
collagen, Collaboratic Biomedical, Bedford, MA.) 

20 polycarbonate membrane filters with a pore diameter 
of 8 pm (NeuroProbe, Cabin John, MD. ) . Cell 
suspensions (15, 000/well) were loaded to the upper 
part of the chemotaxis chamber and stimulated for 4 
hours with rhVEGF 16S (0.1-10 ng/rol) (Calbiochem, San 

25 Diego, CA. ) or various concentrations of truncated 
VEGF-X (PDGF domain) . Cells remaining on the top of 
the membrane were removed. Migration was assessed by 
counting the number of cells that migrated to the 
lower side of the filter membrane. The membrane was 

30 fixed with 10% formaldehyde for 15 min, followed by 
staining with Gill's hemotoxylin III (Poly 
Scientific, Bay Shore, NY.). The assay was performed 
in triplicates and six independent high power fields 
per well were counted using a light microscope at 250 

35 magnification. The results were expressed as the fold 
of unstimulated cells (BGM containing 0.1% BSA) . 
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la Vitro Angiogenesis Assay 

In vitro angiogenesis in fibrin gels was quantitated 
using spheroids of human umbilical vein endothelial 
cells (Korff et al., 1998). To generate endothelial 
cell spheroids of defined size and cell number, a 
specific number of cells (- 800 cells per spheroid) 
was suspended in EGM-2 culture medium containing 20% 
methylcellulpse .(Sigma, st. Louis, MO.), seeded into 
nonadherent round-bottom 96-well plates. All 
suspended cells in one well contributed to the 
formation of a single endothelial cell spheroid 
within 24 hours. A fibrin gel stock solution was 
prepared freshly prior to use by mixing 3mg/ml 
fibrinogen (Calbiochem, San Diego, CA. ) in Medium 
199(Gibco, Gaithersburg, MD. ) . Assays were performed 
in 24-well culture plates. The 1ml fibrinogen stock 
was mixed with 50 HOVEC spheroids and the 
corresponding test substance including rh-VEGF us or 
various concentration of V2GF-X. The 
spheriod-containing fibrinogen was rapidly 
transferred into 24-well plates. Fifteen microliters 
of thrombin (100 NIH U/ml stock, Sigma, St. Louis, 
MO.) was added to the gel for the fibrin gel 
formation. The gel formation usually occurred within 
30 seconds. After gel formation, lml/well of Medium 
199 supplemented with 20% FBS, Img/ml e-aminocaproic 
acid (Calbiochem, San Diego, CA.) and antibiotics 
were added. The gel was incubated at 37 °C (5%C0 S , 95% 
air, 100% humidity). After 3 days, in vitro 
angiogenesis was quantitated by measuring the length 
of the three longest capillary sprouts that had grown 
out of each spheroid (100X magnification) , analyzing 
at least 10 spheroids per experimental group and 
experiment. 



Matrigel Mouse Assay 
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The matrigel mouse assay is carried out as described 
by Passanti et al (1992). 

Analysis of VEGF-X gene expression by RT-PCR 
5 analysis. 

Oligonucleotide primers VEGF-E2 and VEGF-X14 (figure 
16; figure 5) were used for the specific PGR 
amplification of- a 350 bp fragment from VEGF-X. PCR 
amplifications were performed on human multiple 

10 tissue cDNA (MTC™) panels (Clontech human MTC panels 
1 and II and human Tumor MTC panel) normalised to the 
mRNA expression levels of six different housekeeping 
genes. In addition, cDNA was made from different 
tumor cell cultures (Caco-2 colorectal 

15 adenocarcinoma; T-84 colorectal carcinoma; MCF-7 
breast adenocarcinoma; T-47D breast ductal gland 
carcinoma; HT1080 bone fibrosarcoma; SaOS-2 
osteosarcoma; SK-N-MC neuroblastoma; HepG2 
hepatoblastoma; JURKAT T-cell leukemia and THP-1 

20 myelomonocytic leukemia) . For the preparation of 

tumor cell cDNA, cells were homogenised and total RNA 
prepared using the RNeasy Mini kit (Qiagen GmbH, 
Hilden, Germany) according to manufacturer's 
instructions. 1 pg of total RNA was reverse 

25 transcribed using oligo(dT)15 as a primer and 50 U of 
Expand™ Reverse Transcriptase (Boehringer Mannheim, 
Mannheim, Germany) according to the manufacturer's 
instructions. PCR reactions with VEGF-X-specif ic ok 
glyceraldehyde-3-phosphate dehydrogenase 

30 (G3PDH) -specific primers were then performed on 1 pi 
of this cDNA. For all cDNAs, PCR reactions with 
VEGF-X specific primers were performed in a total 
volume of 50 pi, containing 5 pi (±1 ng) of cDNA, lx 
Advantage KlenTaq PCR reaction buffer, 0.2 mM dNTP, 

35 250 nM of primers VEGF-E2 and VEGF-X14 and 1 pi of 

Advantage KlenTaq polymerase mix. Samples were heated 
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to 95 °c for 30 s and cycling was done for 30 s at 
95-C and 30 s at 68'C for 25, 30 or 35 cycles. 
Control reactions using specific primers that amplify 
a 1 Jcb fragment of the housekeeping gene G3PDH were 
also performed according to the manufacturer's 
instructions. 

Northern blot analysis of VEGF-X. 

Northern blots containing 2 ug of poly ( A) -rich RNA 
derived from different human tissues (Clontech 
Laboratories; MTN*» blot, MTN™' blot II and Cancer Cell 
Line MTN™ blot) were hybridized according to the 
manufacturers instructions with a ex- [«P] -dCTP 
random-priming labelled (Multiprime labelling kit, 
Roche Diagnostics) 293 bp specific VEGF-X fragment 
IPiiOX-Stul fragment including 92 bp of the 3' end 
coding region and 201 bp of the 3' untranslated 
region of VEGF-X) . The blots were hybridized 
overnight at 68 »C and final washes at high stringencv 
20 were at 68»C in O.lx SSC/0.1 % SDS . The membranes " 
were autoradiographed for 1 to 3 days with 
intensifying screens. 



15 



25 



Full length VEGF-X 

The effect of full length VEGF-X on proliferation of 
HUVEC cells was determined by the 3 H-Thymidine 
incorporation assay. HuVEC cells were serum starved 
for 24 hours prior to treatment with the full length 
VEGF-x at the concentration range from 100 pg/ml-10 
ug/ml. There was no effect of . VEGF-X at 100 pg/ml-10 
ng/ml on endothelial cell proliferation. At the 
higher concentrations of FL-VEGF-X (100 ng/ml and 1 
pg/ml) there was a marked inhibition of endothelial 
cell proliferation. This is probably due to the very 
high endotoxin level in the samples. The VEGF-X 
sample was purified in order to decrease the 
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endotoxin level and is currently tested in the cell 
proliferation assay. 

The Summary from Testing the CUB Domain 
5 The effect of CUB domain on inhibition of HuVEC 

prolieration either serum- (2%), rh-VEGF or bFGF- 
stimulated, was assessed by the 3 H-Thymidine 
incorporation assay. Cells were serum starved 
followed by the treatment with the CUB domain and 
10 various growth factors. Results showed that the CUB 
domain inhibited endothelial cell proliferation, 

either serum- (2%), rh-VEGF or bFGF-stimulated in a < ) 

dose dependent manner with maximal inhibition at 10 
pg/ml. There was approximately a 2-fold inhibition 

15 of proliferation (at 10 pg/ml) of cells stimulated 

with VEGF and bFGF and nearly a 5-fold inhibition of 
cells stimulated with serum (2%). Results with the 
LDH assay showed that there was no cytotoxicity 
associated with the inhibition of cell proliferation 

20 by the CUB domain. 

Therefore, the N-terminus of the polypeptide from 
Figure 10 has been shown to possess a CUB domain. 
When database searches are carried out using the 

25 full-length coding sequence the best matches (i.e. 

for a BLAST search/ those with the lowest probability 
score) are found with the CUB domain rather than with 
the VEGF-like domain. The best match from searching 
release 37 of the SWISSPROT database (Feb 1999) is to 

30 the CUB domain of a neuropilin from Xenopus laevis, 
and the matches to the CUB domains of human 
neuropilins 1 and 2 are also more significant than 
matches to the VEGFs. 

35 This similarity is provocative, given the 

identification of neuropilin-1 and -2 as cellular 
receptors for the VEGF-A 165 (Stoker et al. 1998, 
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reviewed in Neufeld et si. 1999). It is plausible 
therefore that VEGF-x could exert dual regulatory 
effects: via interaction with the tyrosine kinase 
VEGF-receptors mediated by the VEGF-like domain, ai 
well as via interaction with VEGF isoforros or with 
the neurophilin receptors, mediated by the CUB 
domain . 



To the best of bur understanding the latter would be 
10 entirely novel, and searches on the most recent 
release of the Incyte database do not reveal any 
other proteins containing both CUB and VEGF-like 
domains. This arrangement of domains suggests 
possible positive or negative models of regulation: 



15 



Positive- the VEGF-like domain is able to interact 
productively with the tyrosine kinase VEGF receptors 
giving activation, and the CUB domain is able to 
interact productively with the neuropilin receptor 
20 giving activation. 

Negative- the VEGF-like domain does not interact 
productively with the tyrosine kinase VEGF receptors, 
either preventing receptor dimerisation or blocking 

25 the VEGF binding sites. Further, the CUB domain does 
not interact productively with the neuropilin 
receptors, either preventing receptor activation or 
blocking the VEGF binding sites, or indeed by binding 
to VEGF isoforms and preventing their interaction 

30 with receptors. 
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TABLE 1 



ORIGINAL RESIDUE 


EXEMPLARY 


SUBSTITUTIONS 


ALA 


SER, 


THR 




ARG 


LYS 






ASN 


HIS, 


SER 




ASP 


GLU/ 


ASN 




CYS 


SER 






GLN 


ASN, 


HIS 




GLU 


ASP, 


GLU 




GLY 


ALA, 


SER 




' HIS 


ASN, 


GLN 




ILE 


LEU, 


VAL, 


THR 


LEU 


ILE, 


VAL 




LYS 


ARG, 


GLN, 


GLU, THR 


MET 


LEU, 


ILE, 


VAL 


PHE 


LEU, 


TYR 




SER 


THR, 


ALA, 


ASN 


THR 


SER, 


•ALA 




TRP 


ARG, 


SER 




TYR 


PHE 






VAL 


ILE, 


LEU ALA 


PRO 


ALA 
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ftgOTTENCE LISTING 

Sequence ID No 1 

5 

Sequence ID No 2 

10 

Sequence ID No 3 

15 

Sequence ID No 4 
20 Sequence ID No 5 
Sequence ID No 6 

25 

Sequence ID No 7 

30 

Sequence ID No 8 

35 

Sequence ID No 9 



corresponds to the amino acid 
sequence from position 23 to 345 
of the amino acid sequence 
illustrated in Figure 10. 

is the amino acid sequence 
illustrated in Figure 10. 

corresponds to the sequence from 
position 257 to 1291 of the 
nucleotide sequence illustrated 
in Figure 9. 

corresponds to the polynucleotide 
sequence of VEGFX1 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX2 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX3 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX4 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX5 illustrated in 
Figure 3. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 



WO 00/37641 



PCT/US99/30503 



- 53 - 



Figure 3, 



Sequence ID No 10 



corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 3. 



10 



Sequence ID No 11 



Sequence ID No 12 



corresponds to the polynucleotide 
sequence of VEGFX8 illustrated in 
Figure 3, 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 3. 



15 Sequence ID No 13 corresponds to the polynucleotide 

sequence of VEGFX10 illustrated 
in Figure 3. 

Sequence ID No 14 corresponds to the polynucleotide 
20 sequence of VEGFX11 illustrated 

in Figure 4. 



25 



Sequence ID No 15 corresponds to the polynucleotide 

sequence of VEGFX12 illustrated 
in Figure 4. 



30 



Sequence ID No 16 corresponds to the polynucleotide 

sequence of VEGFX13 illustrated 
in Figure 4 . 



Sequence ID No 17 



corresponds to the polynucleotide 
sequence of VEGFX14 illustrated 
in Figure 4 . 



35 



Sequence ID No 18 



corresponds to the polynucleotide 
sequence 5'-l in Figure 8. 
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Sequence ID No 19 
Sequence ID No 20 

5 

Sequence ID No 21 

10 

Sequence ID No 22 

15 

Sequence ID No 23 
20 Sequence ID No 24 
Sequence ID No 25 

25 

Sequence ID No 26 

30 

Sequence ID No 27 

35 

Sequence ID No 28 
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corresponds to the polynucleotide 
sequence 5 1 -2 in Figure 8. 

corresponds to the polynucleotide 
sequence of VEGFX6 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX7 illustrated in 
Figure 13. 

corresponds to the polynucleotide ,' ^ 

sequence of VEGFX8 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGFX9 illustrated in 
Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC1 illustrated 
in Figure 13. 

corresponds to the polynucleotide 
sequence of VEGBAC2 illustrated 
in Figure 13. 

corresponds to a polypeptide 
having the amino acid sequence 
from amino acid position 40 to 
150 of the sequence of Figure 10. 

corresponds to a polypeptide 
having the amino acid sequence 
illustrated in Figure 26. 

corresponds to the sequence from 
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position 5 to 508 of the 
nucleotide sequence illustrated 
in Figure 26. 

5 Sequence ID No 29 corresponds to the nucleotide 

sequence from position 5 to 508 
of the nucleotide sequence 
illustrated in Figure 26. 

10 Sequence ID No 30 corresponds to the sequence from 

position 214 to 345 of the 
nucleotide sequence illustrated 
in Figure 10. 
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CLAIMS 

1 . A nucleic acid molecule encoding a VEGF-X 
protein or a functional equivalent, derivative or 
5 bioprecursor thereof, said protein comprising any of 
the sequences from position 23 to 345 of the amino 
acid sequence illustrated in Figure 10, or the 
complete sequence as illustrated in Figure 10. 

10 2. A nucleic acid molecule according to claim 1 
wherein said nucleic acid is a DNA molecule. 

3. A nucleic acid molecule according to claim 1 or 
2 wherein said nucleic acid is a cDNA molecule. 

15 

4. A nucleic acid molecule according to claim 3 
comprising the nucleotide sequence from position 257 
to 1291 of the nucleotide sequence illustrated in 
Figure 9, or sequences that hybridise thereto under 

20 high stringency conditions or the complement thereto. 

5. An antisense molecule capable of hybridising to 
a molecule according to any of claims 1 to 4 under 
high stringency conditions. 

25 

6. A nucleic acid molecule according to any of 
claims 1 to 4 which is of mammalian origin. 

7. A nucleic acid molecule according to claim 6 
30 which is of human origin. 

8. An isolated VEGF-X protein, or a functional 
equivalent, derivative or bioprecursor thereof, 
having an amino acid sequence from position 23 to 345 

35 of the amino acid sequence illustrated in Figure 10 
or the complete amino acid sequence of Figure 10. 
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15 



9. A VEGF-X protein, or a functional equivalent, 
derivative or bioprecusor thereof, encoded by a 
nucleic acid molecule as defined in any of claims 1 
to 4. 

10. A protein according to claim 9, which comprises 
the amino acid sequence illustrated in Figure 10. 

11. An expression vector comprising a nucleic acid 
molecule according to any of claims 1 to 4. 

12. An expression vector according to claim 11 
further comprising a nucleotide sequence encoding a 
reporter molecule. 

13. An expression vector comprising an antisense 
molecule according to claim 5. 

14. A nucleic acid molecule according to any of 
claims 1 to 4 or an antisense molecule according to 
claim 5 for use as a medicament. 

15. A host cell transformed or transfected with an 
expression vector according to claim 11 or 12. 

16. A host cell transformed or transfected with an 
expression vector according to claim 13. 

17. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a VEGF-X protein 
according to claim 8 or 9. 

18. A transgenic cell, tissue or organism according 
to claim 17, wherein said transgene is included in an 

35 expression vector. 

19. A VEGF-X protein or a functional equivalent. 



20 



25 



30 
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derivative or bioprecursor thereof/ expressed by a 
cell according to claim 15. 

20. A VEGF-X protein, or a functional equivalent, 
5 derivative or bioprecursor thereof, expressed by a 

transgenic cell, tissue or organism according to 
claim 17. 

21. A process for producing a VEGF-X protein 

10 according to any of claims 8 to 10, said process 

comprising transforming a host cell or organism with 
an expression vector according to claim 11, and 
recovering the expressed protein from said host cell 
or organism. 

15 

22. An antibody capable of binding to a protein 
according to any of claims 8 to 10, or an epitope 
thereof. 

20 23. An antibody according to claim 22 for use as a 
medicament. 

24. A pharmaceutical composition comprising an 
antibody according to claim 22 together with a 

25 pharmaceutical^ acceptable carrier diluent or 
excipient thereof. 

25. A method of identifying VEGF-X protein in a 
sample which method comprises contacting said sample 

30 with an antibody according to claim 22 and monitoring 
for binding of any protein to said antibody. 

26. A kit for identifying the presence of VEGF-X 
protein in a sample which comprises an antibody 

35 according to claim 22 and means for contacting said 
antibody with said sample. 
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27. A method of identifying compounds which modulate 
angiogenesis which method comprises providing a host 
cell or organism according to claim 15 or a 
transgenic cell, tissue or organism according to 
claim 17, contacting a test compound with said cell, 
tissue or organism and monitoring for an effect of 
said compound on said VEGF compared to a host cell or 
organism according to claim 15 or a transgenic cell 
tissue or organism according to claim 17 which has 
not been contacted with said compound. 

28. A compound identifiable according to the method 
of claim 27. 

29. A compound according to claim 2 8 for use as a 
medicament. 



30. A nucleic acid sequence comprising the 
nucleotide sequences illustrated in any of Figures 3, 

20 5, 8 or 13. 

31. A method for producing a polypeptide, said 
method comprising the steps of: 

25 a) culturing the host cell of claim 15 under 

conditions suitable for expression of the 
polypeptide; and 
b) recovering the polypeptide from the host 
cell culture. 

32. A method of inhibiting angiogenic activity and 
inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 
administering to said subject an amount of an 
antisense molecule according to claim 5 in sufficient 
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concentration to reduce or prevent said angiogenic 
activity. 

33. A method of inhibiting angiogenic activity or 
5 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a subject 
said method comprising administering to said subject 
10 an amount of an antibody according to claim 22 in 
sufficient concentration to reduce or prevent said 
angiogenic activity or inappropriate vascularisation. 

34. A method of inhibiting angiogenic activity or 
15 inappropriate vascularisation including any of 

formation and proliferation of new blood vessels, 
growth and development of tissues, tissue 
regeneration and organ and tissue repair in a 
subject, said method comprising implanting in said 
20 subject cells that express an antibody according to 
claim 22. 

35. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 

25 retinopathy, said method comprising administering to 
said subject an amount of an antisense molecule 
according to claim 5 in sufficient concentration to 
treat or prevent said disorders. 

30 36. A method of treating or preventing any of 

cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
said subject an amount of an antibody according to 
claim 22 in sufficient concentration to reduce or 

35 prevent said disorders. 

37. A method of promoting angiogenic activity or 



WO 00/37641 PCT/US99/30503 

- 61 - 



10 



15 



20 



25 



30 
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vascularization to promote wound healing, skin graft 
growth, tissue repair, proliferation of new blood 
vessels, tissue regeneration and organ repair which 
method comprises applying or delivering to a site of 
interest a therapeutically effective amount of any of 
a group selected from a protein according to claim 8 
and a nucleic acid molecule encoding a VEGF-X protein 
or a functional equivalent, derivative or 
bioprecursor thereof comprising an amino acid 
sequence illustrated in Figure 10, an expression 
vector comprising said nucleic acid molecule and a 
pnarmaceutical composition comprising any of said 
nucleic acid molecule and said protein. 

38. A method of treating wounds selected from the 
group consisting of dermal ulcers, pressure sores, 
venous sores, diabetic ulcers and burns by applying 
to said wound a therapeutically effective amount of 
any of a VEGF-X protein according to claim 8, a 
pharmaceutical composition comprising said protein 
and a pharmaceutical^ acceptable carrier, diluent or 
excipient therefor. 

39. A nucleic acid molecule encoding a polypeptide 
having a CUB domain said polypeptide comorising the 
amino acid sequence from position 40 to 150 of the 
sequence of Figure 10. 

40 A nucleic acid molecule encoding a polypeptide 
having a CUB domain, said polypeptide comprising the 
amino acid sequence of Figure 26. 

41. A nucleic acid molecule according to claim 39 or 
40, comprising the nucleotide sequence from position 
5 to 508 of the sequence illustrated in Figure 26. 

42. A nucleic acid molecule according to any of 
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claims 39 to 41 comprising the nucleotide sequence 
illustrated in Figure 26. 

43. A nucleic acid molecule encoding a VEGF like 

5 domain comprising the sequence from position 214-345 
of the sequence of Figure 10 or the sequence from 
position 15 to 4 61 illustrated in Figure 24. 

44. An expression vector comprising a nucleic acid 
10 molecule according to any of claims 39 to 42. 

45. An expression vector comprising a nucleic acid 
molecule according to claim 43. 

15 46. A host cell transformed or transfected with an 
expression vector according to claim 44. 

47. A host cell transformed or transfected with an 
expression vector according to claim 45. 

20 

48. A protein expressed by the cell according to 
claim 46. 

49. A protein expressed by the cell according to 
25 claim 47. 

50. A method of identifying compounds that inhibit 
or enhance angiogenic activity, said method 
comprising contacting a cell expressing a VEGF 

30 receptor and/or a neuropilin 1 or 2 type receptor 
with said compound in the presence of a VEGF-X 
protein according to claim 8 and monitoring for the 
effect of said compound or said cell when compared to 
a cell which has not been contacted with said 

35 compound. 

51. A compound identifiable according to the method 
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of claim 50 as an inhibitor or enhancer of angiogenic 
activity. 

52. A method of inhibiting angiogenic activity or 
5 inappropriate vascularisation, said method comprising 
contacting a cell expressing, a VEGF receptor and a 
neuropilin type receptor with a protein selected from 
any of a protein according to any of claims 8 to 10 
and a protein according to claim 48 or a protein 
according to claim 49. 

53. Use of a nucleotide sequence illustrated in any 
of Figures 14 and 15 in identifying a VEGF-X protein 
according to claim 8. 

54. A nucleic acid molecule encoding a polypeptide 
comprising a CUB domain having the sequence from 
position 4 0 to 150 of the sequence of Figure 10 or 
from position 5 to 508 of the sequence of Figure 26 
and a sequence encoding a VEGF domain. 



10 



IS 



20 



55. A nucleic acid molecule according to claim 54 
wherein said sequence encoding said VEGF domain is 
selected from the sequences encoding any of VEGF A to 

25 d or isoforms or variants thereof. 

56. A nucleic acid molecule encoding a polypeptide 
comprising the amino acid sequence from position 40 
to 150 of the sequence illustrated in Figure 10 for 
use as a medicament. 



30 



35 



57. Use of a nucleic acid molecule encoding a 
polypeptide having the amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 in the manufacture of a medicament for 
treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour or cancer 
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growth, retinopathy , osteoarthritis or psoriasis. 

58. A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
5 in figure 10 for use as a medicament, 

59- A polypeptide comprising the amino acid sequence 
from position 40 to 150 of the sequence illustrated 
in Figure 10 in the manufacture of a medicament for 
10 the treatment of disease conditions associated with 
inappropriate angiogenesis such as tumour growth,, 
retinopathy, osteoarthritis or psoriasis. 

60. Use of a CUB domain comprising the amino acid 
15 sequence from position 40 to 150 of the sequence of 
Figure 10/ or the amino acid sequence of Figure 2 6, 
to identify compounds which inhibit angiogenic 
activity in a method according to claim 50. 

20 61. A method of inhibiting angiogenic activity and 

inappropriate vascularisation including formation and 
proliferation of new blood vessels, growth and 
development of tissues, tissue regeneration and organ 
and tissue repair in a subject said method comprising 

25 administering to said subject an amount of a 

polypeptide having an amino acid sequence from 
position 40 to 150 of the sequence illustrated in 
Figure 10 or a nucleic acid molecule according to any 
of claims 39 to 42 in sufficient concentration to 

30 reduce or prevent said angiogenic activity. 

62. A method of treating or preventing any of 
cancer, rheumatoid arthritis, psoriasis and diabetic 
retinopathy, said method comprising administering to 
35 said subject an amount of a polypeptide having an 
amino acid sequence from position 40 to 150 of the 
sequence illustrated in Figure 10 or a nucleic acid 
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molecule according to any of claims 39 to 42 in 
sufficient concentration to treat or prevent said 
disorders. 



63. An antisense molecule capable of hybridising to 
a molecule according to any of claims 39 to 42 under 
high stringency conditions. 

64. An antisense molecule capable of hybridising to 
a molecule according to claim 43 under high 
stringency conditions. 



65. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 
15 to claim 48 . 



20 



25 



30 



66. A transgenic cell, tissue or organism comprising 
a transgene capable of expressing a protein according 
to claim 49. 

67. a transgenic, cell tissue or organism 
according to claim 65 or 66, wherein said transgene 
is included in an expression vector according to 
claim 41 or 42. 

68. An antibody capable of binding to a protein 
according to claim 48 or an epitope thereof. 

69. An antibody capable of binding to a protein 
according to claim 4 9 or an epitope thereof. 



70. A pharmaceutical composition comprising an 
antibody according to claim 68 or 69 together with a 
pharmaceutical^ acceptable carrier diluent or 

35 excipient therefor. 

71. A pharmaceutical composition comprising a 
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compound according to claim 48 together with a 
pharmaceutical^ acceptable carrier, diluent or 
excipient therefor* 



5 



72. A nucleic acid molecule encoding a variant of a 
VEGF-X protein having any of the sequences of 
nucleotides illustrated in Figure 12. 
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1 AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG GCTTGAAGAC CCAGAAGATG 
TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC CGAACTTCTG GGTCTTCTAC 

61 ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG TGATGGAACT ATATTAGGGC 
TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC ACTACCTTGA TATAATCCCG 

121 GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC TAAAGGAAAT CAAATTAGGA 
CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG ATTTCCTTTA GTTTAATCCT 

+1 MetAsn IlePheLeu LeuAsnLeuLeu ThrGluGlu ValArgLeu 

181 TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCTTC TAACAGAGGA GGTAAGATTA 
ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGAAG ATTGTCTCCT CCATTCTAAT 

t-1 TyrSerCysThr ProArgAsn PheSerVal SerlleArgGlu GluLeuLys ArgThrAsp 

2 41 TACAGCTGCA CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT 

ATGTCGACGT GTGGAGCATT GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA 

+1 ThrllePheTrp ProGlyCys LeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys 

3 01 ACCATTTTCT GGCCAGGTTG TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT 

TGGTAAAAGA CCGGTCCAAC AGAGGACCAA TTTGCGACAC CACCCTTGAC ACGGACAACA 

+ 1 LeuHisAsnCys AsnGluCys GlnCysVal ProSerLysVal ThrLysLys TyrHisGlu 

3 61 CTCCACAATT GCAATGAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG 
GAGGTGTTAA CGTTACTTAC AGTTACACAG GGTTCGTTTC AATGATTTTT TATGGTGCTC 

+ 1 ValLeuGlnLeu ArgProLys ThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal 

421 GTCCTTCAGT TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG 
CAGGAAGTCA ACTCTGGTTT CTGGCCACAG TCCCCTAACG TGTTTAGTGA GTGGCTGCAC 

+ 1 AlaLeuGluHis HisGluGlu CysAspCys ValCysArgGly SerThrGly Gly 

~ — > 

481 GCCCTGGAGC ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCCG 
CGGGACCTCG TGGTACTCCT CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC 

541 CATCACCACC AGCAGCTCTT GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA 
GTAGTGGTGG TCGTCGAGAA CGGGTCTCGA CACGTCACGT CACCGACTAA GATAATCTCT 

601 ACGTATGCGT TATCTCCATC CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA 
TGCATACGCA ATAGAGGTAG GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT 

661 GGATTTACAG TGCATTCTGA AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC 
CCTAAATGTC ACGTAAGACT TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG 

721 TCTTTTGAGA GGAGGCCTAA AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA 
AGAAAACTCT CCTCCGGATT TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT 

7 81 AATGTTG T AT TAAATAGATC ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG 
TTACAACATA ATTTATCTAG TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC 
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8 41 CTGGGTTCTG TATTTCAGTT CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA 

GACCCAAGAC ATAAAGTCAA GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT 

9 01 ACTGTGCAAG TGAGCACCTG ATTCCGTTGC CTTGCTTAAC TCTAAAGCTC CATGTCCTGG 

TGACACGTTC ACTCGTGGAC TAAGGCAACG GAACGAATTG AGATTTCGAG GTACAGGACC 

9 61 GCCTAAAATC GTATAAAATC TGGATTTTTT TT TTTTTTTT TGCTCATATT CACATATGTA 
CGGATTTTAG CATATTTTAG ACCTAAAAAA AAAAAAAAAA ACGAGTATAA GTGTATACAT 

1021 AACCAGAACA TTCTATGTAC TACAAACCTG GTTTTTAAAA AGGAACTATG TTGCTATGAA 
TTGGTCTTGT AAGATACATG ATGTTTGGAC CAAAAATTTT TCCTTGATAC AACGATACTT 

1081 TTAAACTTGT GTCGTGCTGA TAGGACAGAC TGGATTTTTC ATATTTCTTA TTAAAATTTC 
AATTTGAACA CAGCACGACT ATCCTGTCTG AC CT AAAAAG TATAAAGAAT AATTTTAAAG 

1141 TGCCATTTAG AAGAAGAGAA C TACATTC AT GGTTTGGAAG AGATAAACCT GAAAAGAAGA 
ACGGTAAATC TTCTTCTCTT GATGTAAGTA CCAAACCTTC TCTATTTGGA CTTTTCTTCT 

1201 GTGGCCTTAT CTTCACTTTA TCGATAAGTC AGTTT AT TTG TTTCATTGTG TACATTTTTA 
CACCGGAATA GAAGTGAAAT AGCTATTCAG TCAAATAAAC AAAGTAACAC ATGTAAAAAT 

12 61 TATTCTCCTT TTG AC ATT AT AACTGTTGGC TTTTCTAATC TTGTTAAATA TATCTATTTT 

ATAAGAGGAA AACTGTAATA TTG AC AAC CG AAAAGATTAG AACAATTTAT ATAGATAAAA 

1321 TACCAAAGGT ATTTAATATT CTTTTTTATG ACAACTTAGA TCAACTATTT TTAGCTTGGT 
ATGGTTTC C A TAAATTATAA GAAAAAATAC TGTTGAATCT AG TTG AT AAA AATCGAACCA 

13 81 AAATTTTTCT AAACACAATT GTT AT AG CCA GAGGAACAAA GATGATATAA AATATTGTTG 

TTTAAAAAGA TTTGTGTTAA CAATATCGGT CTCCTTGTTT C TACT AT ATT TT AT AAC AAC 

1441 CTCTGACAAA AATACATGTA TTTCATTCTC GTATGGTGCT AGAGTTAGAT TAATCTGCAT 
GAGACTGTTT TTATGTACAT AAAGTAAGAG C A.T AC C AC G A TCTCAATCTA ATT AG AC GT A 

1501 TTTAAAAAAC TGAATTGGAA TAGAATTGGT AAGTTGCAAA GACTTTTTGA AAATAATTAA 
AAATTTTTTG ACTTAACCTT ATCTTAACCA TTCAACGTTT CTGAAAAACT TTTATTAATT 

1561 ATTATCATAT CTTCCATTCC TGTTATTGGA GATGAAAATA AAAAGCAACT TATGAAAGTA 
TAATAGTATA GAAGGTAAGG ACAATAACCT C TAG T T TT AT TTTTCGTTGA ATACTTTCAT 

1621 GACATTCAGA TCCAGCCATT ACTAACCTAT TCCTTTTTTG GGGAAATCTG AGCCTAGCTC 
CTGTAAGTCT AGGTCGGTAA T G ATTGG AT A AGGAAAAAAC CCCTTTAGAC TCGGATCGAG 

16 81 AGAAAAACAT AAAGCACCTT G AAAAAG ACT TGGCAGCTTC CTGATAAAGC GTGCTGTGCT 
TCTTTTTGTA TTTCGTGGAA CTTTTTCTGA ACCGTCGAAG GACTATTTCG CACGACACGA 

1741 GTGCAGTAGG AACACATCCT ATTTATTGTG ATGTTGTGGT TTTATTATCT TAAACTCTGT 
CACGTCATCC TTGTGTAGGA TAAATAACAC TACAACACCA AAATAATAGA ATTTGAGACA 

1801 TCCATACACT TGTATAAATA C ATGG AT ATT TTTATGTACA GAAGTATGTC TCTTAACCAG 
AGGTATGTGA ACATATTTAT GTACCTATAA AAATACATGT C TTC AT AC AG AGAATTGGTC 

1861 TTC AC TT ATT GTACCTGG 
AAGTGAATAA C ATGG AC C 
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s^Ttx J? Predicted VEGF-iike protein encoded by Incyte contig of a/12/98 

1 MNIFLLNLLT ESVRLYSCTP RNFSVSIRES LKRTDTIFWP GCLLVKRCGG 
SI NCACCLHNCN ECQCVPSKVT KXYHTVLQLR PXTGVRGLHX SLTDVALEHH 
101 EECDCVCRGS TGG 



y^P^x^ pcr primers for cloning VEGF-X 
vegfXl AAAATGTATGQATACAACTTAC 

vegfX2 GTTTGATGAAAGATTTGGGC7TG 

vegfX3 TTTCTAAAGGAAATCAAATTAG 

vegfX4 GATAAGATTTGTATCTGATG 

vegfX5 GATGTCTCCTCTTTCAG 

vegfX6 GCACAACTCCTAATTCTG 

vegfX7 AGCACCTGATTCCGTTGC 

vegfX8 TAGTACATAGAATGTTCTGG 

vegfX9 AAGAGACATACTTCTGTAC 

vegfXlO CCAGGTACAATAAGTGAACTG 
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Variants isolated by PCR (at 8/2/99, all cloned and sequenced at 

JRF) 



a b c d e f 

PCR primers — > -» <- <- «- <- 



lncyte contig C 
(8/12/98) 



clone 22, 29, 41 



clone 52, 59 



clone 15,20 £ 



clones 57, 25, 
26, 27 



2.1 kb clones 1, C 
2,3 



primers- a- vegfXI b- vegfX2 c- vegfXS 

(see fig 3) d- vegfXS e- vegfX9 f- vegfX-1 0 
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/^{P.dT. VEGF-X 5' RACE primers 

vegfXl 1 C CTTTAG AAATCTGT7TTCCTG GTACAG 

vegfXl 2 GGAAAATATTC ATCAGATACAAATCTTATCC 

vegfXl 3 GGTCC AGTGGCAAAGCTGAAGG 

vegfXl 4 CTGG7TCAAGATATCGAATAAGGTCTTCC 
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DNA sequence assembled from in-house clones and 5'RACE 



1 TGCCAGAGCA GGTGGGCGCT TCCACCCCAG TGCAGCCTTC CCCTGGCGGT GGTGAAAGAG 
ACGGTCTCGT CCACCCGCGA AGGTGGGGTC ACGTCGGAAG GGGACCGCCA CCACTTTCTC 

61 ACTCGGGAGT CGCTGCTTCC AAAGTGCCCG CCGTGAGTGA GCTCTCACCC CAGTCAGCCA 
TGAGCCCTCA GCGACGAAGG TTTCACGGGC GGCACTCACT CGAGAGTGGG GTCAGTCGGT 

+2 MetSerLeu PheGlyLeuLeu LeuLeuThr SerAlaLeu AlaGlyGlnArg GlnGlyTh 

1 

121 AATGAGCCTC TTCGGGCTTC TCCTGCTGAC ATCTGCCCTG GCCGGCCAGA GACAGGGGAC 
TTACTCGGAG AAGCCCGAAG AGGACGACTG TAGACGGGAC CGGCCGGTCT CTGTCCCCTG 

+2 rGlnAlaGlu SerAsnLeuSer SerLysPhe GlnPheSer SerAsnLysGlu GlnAsnGl 



181 TCAGGCGGAA TCCAACCTGA GTAGTAAATT CCAGTTTTCC AGCAACAAGG AACAGAACGG 
AGTCCGCCTT AGGTTGGACT CATC ATTT AA GGTCAAAAGG TCGTTGTTCC TTGTCTTGCC 

+2 yValGlnAsp ProGlnHisGlu Argllelle ThrValSer ThrAsnGlySer IleHisSe 



2 41 AGTACAAGAT CCTCAGCATG AGAGAATTAT TACTGTGTCT ACTAATGGAA GTATTCACAG 
TCATGTTCTA GGAGTCGTAC TCTCTTAATA ATGACACAGA TGATTACCTT CATAAGTGTC 

+2 rProArgPhe ProHisThrTyr ProArgAsn ThrValLeu ValTrpArgLeu ValAlaVa 



3 01 CCCAAGGTTT CCTCATACTT ATC C AAG AAA TACGGTCTTG GTATGGAGAT TAGTAGCAGT 
GGGTTCCAAA GGAGTATGAA TAGGTTCTTT ATGCCAGAAC CATACCTCTA ATCATCGTCA 

+2 lGluGluAsn ValTrpIleGln LeuThrPhe AspGluArg PheGlyLeuGlu AspProGl 



3 61 AG AGG AAAAT GTATG GAT AC AACTTACGTT TGATGAAAGA TTTGGGCTTG AAGACCCAGA 
TCTCCTTTTA CATACCTATG TTGAATGCAA ACTACTTTCT AAACCCGAAC TTCTGGGTCT 

+2 uAspAspIle CysLysTyrAsp PheValGlu ValGluGIu ProSerAspGly ThrlleLe 



421 AG AT G AC AT A TGCAAGTATG ATTTTGTAGA AGTTGAGGAA C C C AG TG AT G G AAC TAT ATT 
TCTACTGTAT ACGTTCATAC TAAAACATCT TCAACTCCTT GGGTCACTAC CTTGATATAA 

+2 uGlyArgTrp CysGlySerGly ThrValPro GlyLysGln IleSerLysGly AsnGlnll 



4 81 AGGGCGCTGG TGTGGTTCTG GTACTGTACC AGG AAAAC AG ATTTCTAAAG GAAATCAAAT 
TCCCGCGACC AC AC C AAG AC CATGACATGG TCCTTTTGTC TAAAGATTTC CTTTAGTTTA 

+2 eArglleArg PheValSerAsp GluTyrPhe ProSerGlu ProGlyPheCys IleHisTy 



541 TAGGATAAGA TTTGTATCTG ATGAATATTT TCCTTCTGAA CCAGGGTTCT GCATCCACTA 
ATCCTATTCT AAACATAGAC T AC TT AT AAA AGG AAG AC TT GGTCCCAAGA CGTAGGTGAT 

+2 rAsnlleVal MetProGlnPhe ThrGluAla ValSerPro SerValLeuPro ProSerAl 



6 01 CAACATTGTC ATGCCACAAT TC AC AG AAG C TGTGAGTCCT TCAGTGCTAC CCCCTTCAGC 
GTTGTAACAG TACGGTGTTA AGTGTCTTCG ACACTCAGGA AGTCACGATG GGGGAAGTCG 

+2 aLeuProLeu AspLeuLeuAsn AsnAlalle ThrAlaPhe SerThrLeuGlu AspLeuIl 



6 61 TTTGC CACTG GACCTGCTTA ATAATGCTAT AACTGC CTTT AGT AC CTTGG AAGACCTTAT 
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+2 eArgTyrLeu GluProGluArg TrpGlnLeu AspLeuGlu AspLeuTyrArg ProThrTr 



721 TCGATATCTT GAACCAGAGA GATGGCAGTT GGACTTAGAA GATCTATATA GGCCAACTTG 
AGCTATAGAA CTTGGTCTCT CTACCGTCAA CCTGAATCTT CTAGATATAT CCGGTTGAAC 

+2 pGlnLeuLeu GlyLysAlaPhe ValPheGly ArgLysSer ArgValValAsp LeuAsnLe 



781 GCAACTTCTT GGCAAGGCTT TTGTTTTTGG AAGAAAATCC AGAGTGGTGG ATCTGAACCT 
CGTTGAAGAA CCGTTCCGAA AACAAAAACC TTCTTTTAGG TCTCACCACC TAGACTTGGA 

+2 uLeuThrGlu GluValArgLeu TyrSerCys ThrProArg AsnPheSerVal SerlleAr 



8 41 TCTAACAGAG GAGGTAAGAT TATACAGCTG CACACCTCGT AACTTCTCAG TGTCCATAAG 
AGATTGTCTC CTCCATTCTA ATATGTCGAC GTGTGGAGCA TTGAAGAGTC ACAGGTATTC 

+2 gGluGluLeu LysArgThrAsp ThrllePhe TrpProGly CysLeuLeuVal LysArgCy 



901 GGAAGAACTA AAGAGAACCG ATACCATTTT CTGGCCAGGT TGTCTCCTGG TTAAAC GCTG 
CCTTCTTGAT TTCTCTTGGC TATGGTAAAA GACCGGTCCA ACAGAGGACC AATTTGCGAC 

+2 sGlyGlyAsn CysAlaCysCys LeuHisAsn CysAsnGlu CysGlnCysVal ProSerLy 



961 TGGTGGGAAC TGTGCCTGTT GTCTCCACAA TTGCAATGAA TGTCAATGTG TCCCAAGCAA 
ACCACCCTTG ACACGGACAA CAGAGGTGTT AACGTTACTT ACAGTTACAC AG GGTTCG TT 

+2 sValThrLys LysTyrHisGlu ValLeuGln LeuArgPro LysThrGlyVal ArgGlyLe 



1021 AG TTACTAAA AAATACCACG AGGTCCTTC A GTTGAGACCA AAGACCGGTG TCAGGGGATT 
TCAATGATTT TTTATGGTGC TCCAGGAAGT CAACTCTGGT TTCTGGCCAC AGTCCCCTAA 



+2 uHisLysSer LeuThrAspVal AlaLeuGlu HisHisGlu GluCysAspCys ValCysAr 



1081 


GCACAAATCA 
CGTGTTTAGT 


CTCACCGACG 
GAGTGGCTGC 


TGGCC CTGG A 
ACCGGGACCT 


GCACCATGAG 
CGTGGTACTC 


GAGTGTGACT 
CTCACACTGA 


GTGTG TGC AG 
CACACACGTC 


+ 2 


gGlySerThr 


GlyGly 










1141 


AGGGAGCACA 
TCCCTCGTGT 


GGAGGATAGC 
CCTCCTATCG 


CGCATCACCA 
GCGTAGTGGT 


CCAGCAGCTC 
GGTCGTCGAG 


TTGCCCAGAG 
AACGGGTCTC 


CTG TGC AGTG 
GACACGTCAC 


1201 


CAGTGGCTGA 
GTCACCGACT 


TTCTATTAGA 
AAGATAATCT 


GAACGTATGC 
CTTG C ATACG 


GTTATCTCCA 
CAATAGAGGT 


TCCTTAATCT 
AGGAATTAGA 


C AG TTGTTTG 
GTCAACAAAC 


1261 


CTTCAAGGAC 
GAAGTTCCTG 


CTTTCATCTT 
GAAAGTAGAA 


CAGGATTTAC 
GTCCTAAATG 


AGTGCATTCT 
TCACGTAAGA 


GAAAGAGGAG 
CTTTCTCCTC 


ACATCAAACA 
TGTAGTTTGT 


1321 


G AATTAG GAG 
CTTAATCCTC 


TTGTGCAACA 
AACACGTTGT 


GCTCTTTTGA 
CGAGAAAACT 


GAGGAGGCCT 
CTCCTCCGGA 


AAAGGACAGG 
TTTCCTGTCC 


AG AAAAG GTC 
TCTTTTCCAG 


1381 


TTCAATCGTG 
AAGTTAGCAC 


GAAAGAAAAT 
CTTTCTTTTA 


TAAATGTTGT 
ATTTACAACA 


ATTAAATAGA 
TAATTTATCT 


TCACCAGCTA 
AGTGGTCGAT 


GTTTCAGAGT 
CAAAGTCTCA 


1441 


TACCATGTAC 
ATGGTACATG 


GTATTCCACT 
CATAAGGTGA 


AGCTGGGTTC 
TCGACCCAAG 


TG T ATT TC AG 
AC AT AAAG TC 


TTCTTTCGAT 
AAGAAAGCTA 


ACGGCTTAGG 
TGCCGAATCC 


1501 


G T AATGTC AG 


TACAGGAAAA 


AAACTGTGCA 


AGTG AG C AC C 


TGATTCCGTT 


GCCTTGCTTA 
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15 61 ACTCTAAAGC TCCATGTCCT GGGCCTAAAA TCGTATAAAA TCTGGATTTT TTTTTTTTTT 
TGAGATTTCG AGGTACAGGA CCCGGATTTT AGCATATTTT AGACCTAAAA AAAAAAAAAA 

1621 TTTGCTCATA TTCACATATG TAAACCAGAA CATTCTATGT ACTACAAACC TGGTTTTTAA 
AAACGAGTAT AAGTG T AT AC ATTTGGTCTT GTAAGATACA TGATGTTTGG ACCAAAAATT 

1681 AAAGGAACTA TGTTGCTATG AATTAAACTT GTGTCGTGCT GATAGGACAG ACTGGATTTT 
TTTCCTTGAT AC AAC GAT AC TTAATTTGAA CACAGCACGA CTATCCTGTC TGACCTAAAA 

1741 TCATATTTCT TATTAAAATT TCTGCCATTT AGAAGAAGAG AACTACATTC ATGGTTTGGA 
AGTATAAAGA ATAATTTTAA AGACGGTAAA TCTTCTTCTC TTGATGTAAG TACCAAACCT 

1801 AGAGATAAAC CTGAAAAGAA GAGTGGCCTT ATCTTCACTT TATCGATAAG CCAGTTTATT 
TCTCTATTTG GACTTTTCTT CTCACCGGAA TAGAAGTGAA ATAGCTATTC GGTCAAATAA 

18 61 TGTTTCATTG TGTACATTTT TATATTCTCC TTTTGACATT ATAACTGTTG GCTTTTCTAA 

ACAAAGTAAC ACATGTAAAA ATATAAGAGG AAAACTGTAA TATTGACAAC CGAAAAGATT 

1921 TCTTG TT AAA TATATCTATT TTTACCAAAG GTATTTAATA TTCTTTTTTA TGACAACTTA 
AGAACAATTT AT AT AG AT AA AAATGGTTTC CATAAATTAT AAGAAAAAAT ACTGTTGAAT 

19 81 GATCAACTAT TTTTAGCTTG GTAAATTTTT CTAAACACAA TTGTTATAGC CAGAGGAACA 

CTAGTTGATA AAAATCGAAC CATTTAAAAA GATTTGTGTT AACAATATCG GTCTCCTTGT 

2041 AAGATGATAT AAAATATTGT TGCTCTGACA AAAATACATG TATTTCATTC TCGTATGGTG 
TTCTACTATA TTTTATAACA ACGAGACTGT TT TTATGT AC ATAAAGTAAG AG CAT AC C AC 

2101 CTAGAGTTAG ATTAATCTGC ATTTTAAAAA ACTGAATTGG AATAGAATTG GTAAGTTGCA 
GATCTCAATC TAATTAGACG TAAAATTTTT TGACTTAACC TT AT C TT AAC CATTCAACGT 

2161 AAGACTTTTT GAAAATAATT AAATTATCAT ATCTTC C ATT CCTGTTATTG GAGATGAAAA 
TTCTGAAAAA CTTTTATTAA TTTAATAGTA TAGAAGGTAA GGACAATAAC CTCTACTTTT 

2221 TAAAAAGCAA CTTATGAAAG T AG AC ATT C A GATCCAGCCA TTACTAACCT ATT CCTTTTT 
ATTTTTCGTT GAATACTTTC ATCTGTAAGT CTAGGTCGGT AATGATTGGA TAAGGAAAAA 

2281 TGGGGAAATC TGAGCCTAGC TCAGAAAAAC ATAAAGCACC TTGAAAAAGA CTTGGCAGCT 
ACCCCTTTAG ACTCGGATCG AGTCTTTTTG TATTTCGTGG AACTTTTTCT GAACCGTCGA 

23 41 TCCTGATAAA GCGTGCTGTG CTGTGCAGTA GGAACACATC CTATTTATTG TGATGTTGTG 
AGGACTATTT CGCACGACAC GACACGTCAT CCTTGTGTAG GATAAATAAC ACTACAACAC 

2 401 GTTTTATTAT CTTAAACTCT GTTCCATACA CTTGTATAAA TACATGGATA TTTTTATGTA 
CAAAATAATA GAATTTGAGA CAAGGTATGT GAACATATTT ATGTACCTAT AAAAATACAT 

2461 CAGAAGTATG TCTCT 
GTCTTCATAC AGAGA 
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7. 



New Sequence + Incyte ESTs 



1 ATTTGTTTAA ACCTTGGGAA ACTGGTTCAG GTCCAGGTTT TGCTTTGATC CTTTTCAAAA 
TAAACAAATT TGGAACCCTT TGACCAAGTC CAGGTCCAAA ACGAAACTAG GAAAAGTTTT 

61 ACTGGAGACA CAGAAGAGGG CTTCTAGGAA AAAGTTTTGG GATGGGATTA TGTGGAAACT 
TGACCTCTGT GTCTTCTCCC GAAGATCCTT TTTCAAAACC CTACCCTAAT ACACCTTTGA 

121 ACCCTGCGAT TCTCTGCTGC CAGAGCAGGC TCGGCGCTTC CACCCCAGTG CAGCCTTCCC 
TGGGACGCTA AGAGACGACG GTCTCGTCCG AGCCGCGAAG GTGGGGTCAC GTCGGAAGGG 

181 CTGGCGGTGG TGAAAGAGAC TCGGGAGTCG CTGCTTCCAA AGTGCCCGCC GTGAGTGAGC 
GACCGCCACC ACTTTCTCTG AGCCCTCAGC GACGAAGGTT TCACGGGCGG CACTCACTCG 

+2 Met SerLeuPhe GlyLeuLeu LeuLeuThrSer AlaLeuAl 

1 

241 TCTCACCCCA GTCAGCCAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC 
AGAGTGGGGT CAGTCGGTTT ACTCGGAGAA GCCCGAAGAG GACGACTGTA GACGGGACCG 

+2 aGlyGlnArg GlnGlyThrGln AlaGluSer AsnLeuSer SerLysPheGln PheSerSe 



3 01 CGGCCAGAGA CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG 
GCCGGTCTCT GTCCCCTGAG TCCGCCTTAG GTTGGACTCA TCATTTAAGG TCAAAAGGTC 

+ 2 rAsnLysGlu GlnTyirGlyVal GlnAspPro GlnHisGlu ArgllelleThr ValSerTh 



3 61 CAACAAGGAA CAGTACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC 
GTTGTTCCTT GTCATGCCTC ATGTTCTAGG AGTCGTACTC TCTTAATAAT GACACAGATG 

+2 rAsnGlySer IleHisSerPro ArgPhePro HisThrTyr ProArgAsnThr ValLeuVa 



421 TAATGGAAGT ATTCACAGCC CAAGGTTTCC TC AT AC TT AT CCAAGAAATA CGGTCTTGGT 
ATTACCTTCA TAAG TGTCGG GTTCCAAAGG AGTATGAATA GGTTCTTTAT GCCAGAACCA 

+ 2 lTrpArgLeu ValAlaValGlu GluAsnVal TrpIleGln LeuThrPheAsp GluArgPh 



481 ATGGAGATTA GTAGCAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG ATGAAAGATT 
TACCTCTAAT CATCGTCATC T C CTTTT AC A TACCTATGTT GAATGCAAAC TACTTTCTAA 

+ 2 eGlyLeuGlu AspProGluAsp AspIleCys LysTyrAsp PheValGluVal GluGluPr 



541 TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC 
ACCCGAACTT CTGGGTCTTC TACTGTATAC GTTCATACTA AAACATCTTC AACTCCTTGG 

+ 2 oSerAspGly ThrlleLeuGly ArgTrpCys GlySerGly ThrValProGly LysGlnll 



601 CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT 
GTCACTACCT TGATATAATC CCGCGACCAC ACCAAGACCA TGACATGGTC CTTTTGTCTA 

+2 eSerLysGly AsnGlnlleArg IleArgPhe ValSerAsp GluTyrPhePro SerGluPr 



661 TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC CTTCTGAACC 
AAGATTTCCT TTAGTTTAAT CCTATTCTAA ACATAGACTA CTTATAAAAG GAAGACTTGG 
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+ 2 oGlyPheCys IleHisTyrAsn IleValMet ProGlnPhe ThrGluAlaVal SerProSe 



721 AGGGTTCTGC ATCCACTACA AC ATTGTC AT GCCACAATTC ACAGAAGCTG TGAGTCCTTC 
TCCCAAGACG TAGGTGATGT TGTAACAGTA CGGTGTTAAG TGT^CTTCGAC ACTCAGGAAG 

+ 2 rValLeuPro ProSerAlaLeu ProLeuAsp LeuLeuAsn AsnAlalleThr AlaPheSe 



7 81 AGTGCTACCC CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG 
TCACGATGGG GGAAGTCGAA ACGGTGACCT GGACGAATTA TTACGATATT GACGGAAATC 

+2 rThrLeuGlu AspLeuIleArg TyrLeuGlu ProGluArg TrpG InLeuAsp LeuGluAs 



841 TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG ACTTAGAAGA 
ATGGAACCTT CTGGAATAAG CTATAGAACT TGGTCTCTCT ACCGTCAACC TGAATCTTCT 

+ 2 pLeuTyrArg ProThrTrpGln LeuLeuGly LysAlaPhe ValPheGlyArg LysSerAr 



9 01 TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA G AAAATC C AG 
AGATATATCC GGTTGAACCG TTGAAGAACC GTTCCGAAAA CAAAAACCTT CTTTTAGGTC 

+ 2 gValValAsp LeuAsnLeuLeu ThrGluGlu ValArgLeu TyrSerCysThr ProArgAs 



961 AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACCTCGTAA 
TCACCACCTA GACTTGGAAG ATTGTCTCCT CCATTCTAAT ATGTCGACGT GTGGAGCATT 

+2 nPheSerVal SerlleArgGlu GluLeuLys ArgThrAsp ThrllePheTrp ProGlyCy 



1021 CTTCTCAGTG TCCATAAGGG AAG AAC T AAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG 
GAAGAGTCAC AGGTATTCCC TTCTTGATTT CTCTTGGCTA TGGTAAAAGA CCGGTCCAAC 

+2 sLeuLeuVal LysArgCysGly GlyAsnCys AlaCysCys LeuHisAsnCys AsnGluCy 



1081 TCTCCTGGTT AAACGCTGTG GTGGGAACTG TGCCTGTTGT CTCCACAATT GCAATGAATG 
AG AGG AC C AA TTTGCGACAC CACCCTTGAC ACGGACAACA GAGGTGTTAA CGTTACTTAC 

+2 sGlnCysVal ProSerLysVal ThrLysLys TyrHisGlu ValLeuGlnLeu ArgProLy 



1141 TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT TGAGACCAAA 
AG TTAC AC AG GGTTCGTTTC AATGATTTTT TATGGTGCTC CAGGAAGTCA ACTCTGGTTT 

+2 sThrGlyVal ArgGlyLeuHis LysSerLeu ThrAspVal AlaLeuGluHis HisGluGl 



1201 GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC ACCATGAGGA 
CTGGCCACAG TCCCCTAACG TG TTTAGTG A GTGGCTGCAC CGGGACCTCG TGGTACTCCT 

+2 uCysAspCys ValCysArgGly SerThrGly Gly 



1261 GTGTGACTGT GT GTGC AG AG GG AG C AC AGG AGGATAGCCG CATCACCACC AGCAGCTCTT 
CACACTGACA CACACGTCTC CCTCGTGTCC TCCTATCGGC GTAGTGGTGG TCGTCGAGAA 

13 21 GCCCAGAGCT GTGCAGTGCA GTGGCTGATT CTATTAGAGA ACGTATGCGT TATCTCCATC 
CGGGTCTCGA C ACGTC AC GT CACCGACTAA GATAATCTCT TGCATACGCA AT AG AGG TAG 

13 81 CTTAATCTCA GTTGTTTGCT TCAAGGACCT TTCATCTTCA GGATTTACAG TGCATTCTGA 
GAATTAGAGT CAACAAACGA AGTTCCTGGA AAGTAGAAGT CCTAAATGTC AC GT AAG ACT 
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1441 AAGAGGAGAC ATCAAACAGA ATTAGGAGTT GTGCAACAGC TCTTTTGAGA GGAGGCCTAA 
TTCTCCTCTG TAGTTTGTCT TAATCCTCAA CACGTTGTCG AGAAAACTCT CCTCCGGATT 

1501 AGGACAGGAG AAAAGGTCTT CAATCGTGGA AAGAAAATTA AATGTTGTAT TAAATAGATC 
TCCTGTCCTC TTTTCCAGAA GTTAGCACCT TTCTTTTAAT TTACAACATA ATTTATCTAG 

1561 ACCAGCTAGT TTCAGAGTTA CCATGTACGT ATTCCACTAG CTGGGTTCTG TATTTCAGTT 
TGGTCGATCA AAGTCTCAAT GGTACATGCA TAAGGTGATC GACCCAAGAC ATAAAGTCAA 

1621 CTTTCGATAC GGCTTAGGGT AATGTCAGTA CAGGAAAAAA ACTGTGCAAG TGAGCACCTG 
GAAAGCTATG CCGAATCCCA TTACAGTCAT GTCCTTTTTT TGACACGTTC ACTCGTGGAC 

1681 ATTCCGTTGC CTTGGCTTAA CTCTAAAGCT CCATGTCCTG GGCCTAAAAT CGTATAAAAT 
TAAGGCAACG GAACCGAATT GAGATTTCGA GGTACAGGAC CCGGATTTTA GCATATTTTA 

1741 CTGGATTTTT TTTTTTTTTT TTGCGCATAT TCACATATGT AAACCAGAAC ATTCTATGTA 
GACCTAAAAA AAAAAAAAAA AACGCGTATA AGTGTATACA TTTGGTCTTG TAAGATACAT 

1801 C T ACAAACCT GGTTTTTAAA AAGGAACTAT GTTGCTATGA ATTAAACTTG TGTCATGCTG 
GATGTTTGGA CCAAAAATTT TTCCTTGATA CAACGATACT TAATTTGAAC ACAGTACGAC 

1861 ATAGGACAGA CTGGATTTTT CATATTTCTT ATTAAAATTT CTGCCATTTA GAAGAAGAGA 
TATCCTGTCT GACCTAAAAA GTATAAAGAA TAATTTTAAA GACGGTAAAT CTTCTTCTCT 

1921 ACTACATTCA TGGTTTGGAA GAGATAAACC TGAAAAGAAG AGTGGCCTTA TCTTCACTTT 
TGATGTAAGT ACCAAACCTT CTCTATTTGG ACTTTTCTTC TCACCGGAAT AGAAGTGAAA 

1981 ATCGATAAGT CAGTTTATTT GTTTCATTGT GTACATTTTT ATATTCTCCT TTTGACATTA 
TAGCTATTCA GTCAAATAAA CAAAGTAACA CATGTAAAAA TATAAGAGGA AAACTGTAAT 

2041 TAACTGTTGG CTTTTCTAAT CTTGTTAAAT ATATCTATTT TTACCAAAGG TATTTAATAT 
ATTGACAACC GAAAAGATTA GAACAATTTA TATAGATAAA AATGGTTTCC ATAAATTATA 

2101 TCTTTTTTAT GACAACTTAG ATCAACTATT TTTAGCTTGG TAAATTTTTC TAAACACAAT 
AGAAAAAATA CTGTTGAATC TAGTTGATAA AAATCGAACC ATTTAAAAAG ATTTGTGTTA 

2161 TGTTATAGCC AGAGGAACAA AGATGATATA AAATATTGTT GCTCTGACAA AAATACATGT 
ACAATATCGG TCTCCTTGTT TCTACTATAT TTTATAACAA CGAGACTGTT TTTATGTACA 

2221 ATTTCATTCT CGTATGGTGC TAGAGTTAGA TTAATCTGCA TTTTAAAAAA CTGAATTGGA 
TAAAGTAAGA GCATACCACG ATCTCAATCT AATTAGACGT AAAATTTTTT GACTTAACCT 

22 81 ATAGAATTGG TAAGTTGCAA AGACTTTTTG AAAATAATTA AATTATCATA TCTTCCATTC 
TATCTTAACC ATTCAACGTT TCTGAAAAAC TTTTATTAAT TTAATAGTAT AGAAGGTAAG 

2341 CTGTTATTGG AGATGAAAAT AAAAAGCAAC TTATGAAAGT AGACATTCAG ATCCAGCCAT 
GACAATAACC TCTACTTTTA TTTTTCGTTG AATACTTTCA TCTGTAAGTC TAGGTCGGTA 

2401 TACTAACCTA TTCCTTTTTT GGGGAAATCT GAGCCTAGCT CAGAAAAACA TAAAGCACCT 
ATGATTGGAT AAGGAAAAAA CCCCTTTAGA CTCGGATCGA GTCTTTTTGT ATTTCGTGGA 

2461 TG ^^ A ^ GAC TTGGCAGCTT CCTGATAAAG CGTGCTGTGC TGTGCAGTAG GAACACATCC 
ACTTTTTCTG AACCGTCGAA GGACTATTTC GCACGACACG ACACGTCATC CTTGTGTAGG 

2 521 TATTTATTGT GATGTTGTGG TTTTATTATC TTAAACTCTG TTCCATACAC TTGTATAAAT 
ATAAATAACA CTACAACACC AAAATAATAG AATTTGAGAC AAGGTATGTG AACATATTTA 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 



12 / 54 



PCT/US99/30503 



2581 ACATGGATAT TTTTATGTAC AGAAGTATGT CTCTTAACCA GTTCACTTAT TGTACTCTGG 
TGTACCTATA AAAATACATG TCTTCATACA GAGAATTGGT CAAGTGAATA ACATGAGACC 

2 641 CAATTTAAAA GAAAATCAGT AAAATATTTT GCTTGTAAAA TGCTTAATAT CGTGCCTAGG 

GTTAAATTTT CTTTTAGTCA TTTTATAAAA CGAACATTTT ACGAATTATA GCACGGATCC ^ 



27 01 TTATGTGGTG ACTATTTGAA TCAAAAATGT ATTGAATCAT CAAATAAAAG AATGTGGCTA 

AATAC AC C AC TGATAAACTT AGTTTTTACA TAACTTAGTA GTTTATTTTC TTACACCGAT 

27 61 TTTTGGGGAG AAAATT 

AAAACCCCTC TTTTAA 



Additional oligonucleotides used for amplification of entire 
coding region 



5'-1 TTTGTTTAAACCTTGGQAAACTGG 
5'-2 GTCCAGGTTTTGCTTTGATCC 
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DNA Sequence Of Clones 4 & 7, Identical Clones Containing The 
Entire Open Reading Frame 

1 TTTGTTTAAA CCTTGGGAAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC TTTTCAAAAA 
AAACAAATTT GGAACCCTTT GACCAAGTCC AGGTCCAAAA CGAAACTAGG AAAAGTTTTT 

61 CTGGAGACAC AGAAGAGGGC TCTAGGAAAA AGTTTTGGAT GGGATTATGT GGAAACTACC 
GACCTCTGTG TCTTCTCCCG AGATCCTTTT TCAAAACCTA CCCTAATACA CCTTTGATGG 

121 CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAG CCTTCCCCTG 
GACGCTAAGA GACGACGGTC TCGTCCGAGC CGCGAAGGTG GGGTCACGTC GGAAGGGGAC 

181 GCGGTGGTGA AAGAGACTCG GGAGTCGCTG CTTCCAAAGT GCCCGCCGTG AGTGAGCTCT 
CGCCACCACT TTCTCTGAGC CCTCAGCGAC GAAGGTTTCA CGGGCGGCAC TCACTCGAGA 

+2 MetSer LeuPheGly LeuLeuLeu LeuThrSerAla LeuAlaGl 

) 

241 CACCCCAGTC AGCCAAATGA GCCTCTTCGG GCTTCTCCTG CTGACATCTG CCCTGGCCGG 
GTGGGGTCAG TCGGTTTACT CGGAGAAGCC CGAAGAGGAC GACTGTAGAC GGGACCGGCC 

+2 yGlnArgGln GlyThrGlnAla GluSerAsn LeuSerSer LysPheGlnPhe SerSerAs 

3 01 CCAGAGACAG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAATTCCAGT TTTCCAGCAA 
GGTCTCTGTC CCCTGAGTCC GCCTTAGGTT GGACTCATCA TTTAAGGTCA AAAGGTCGTT 

+2 nLysGluGln AsnGlyValGln AspProGln HisGluArg IlelleThrVal SerThrAs 

3 61 CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATGAGAGA ATT ATT AC TG TGTCTACTAA 
GTTCCTTGTC TTGCCTCATG TTCTAGGAGT CGTACTCTCT TAATAATGAC ACAGATGATT 

+2 nGlySerlle HisSerProArg PheProHis ThrTyrPro ArgAsnThrVal LeuValTr 

421 TGGAAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA AGAAATACGG TCTTGGTATG 
ACCTTCATAA GTGTCGGGTT CCAAAGGAGT ATGAATAGGT TCTTTATGCC AGAACCATAC 

+ 2 pArgLeuVal AlaValGluGlu AsnValTrp IleGlnLeu ThrPheAspGlu ArgPheGl 

481 GAGATTAGTA GCAGTAGAGG AAAATGTATG GATACAACTT ACGTTTGATG AAAGATTTGG 
CTCTAATCAT CGTCATCTCC TTTTACATAC CTATGTTGAA TGCAAACTAC TTTCTAAACC 

+2 yLeuGluAsp ProGluAspAsp IleCysLys TyrAspPhe ValGluValGlu GluProSe 

541 GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATGATTTT GTAGAAGTTG AGGAACCCAG 
CGAACTTCTG GGTCTTCTAC TGTATACGTT CATACTAAAA CATCTTCAAC TCCTTGGGTC 

+ 2 rAspGlyThr IleLeuGlyArg TrpCysGly SerGlyThr ValProGlyLys GlnlleSe 

601 TGATGGAACT ATATTAGGGC GCTGGTGTGG TTCTGGTACT GTACCAGGAA AACAGATTTC 
ACTACCTTGA TATAATCCCG CGACCACACC AAGACCATGA CATGGTCCTT TTGTCTAAAG 

+2 rLysGlyAsn GlnlleArglle ArgPheVal SerAspGlu TyrPheProSer GluProGl 

661 TAAAGGAAAT CAAATTAGGA TAAGATTTGT ATCTGATGAA TATTTTCCTT CTGAACCAGG 
ATTTCCTTTA GTTTAATCCT ATTCTAAACA TAGACTACTT ATAAAAGGAA GACTTGGTCC 
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+ 2 yPheCysIle HisTyrAsnlle ValMetPro GlnPheThr GluAlaValSer ProSerVa 



721 GTTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA GTCCTTCAGT 
CAAGACGTAG GTGATGTTGT AACAGTACGG TGTTAAGTGT CTTCGACACT CAGGAAGTCA 

+ 2 ILeuProPro SerAlaLeuPro LeuAspLeu LeuAsnAsn AlalleThrAla PheSerTh 



7 81 GCTACCCCCT TCAGCTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG CCTTTAGTAC 
CGATGGGGGA AGTCGAAACG GTGACCTGGA CGAATTATTA CGATATTGAC GGAAATCATG 

+2 rLeuGluAsp LeuIleArgTyr LeuGluPro GluArgTrp GlnLeuAspLeu GluAspLe 



841 CTTGGAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGG CAGTTGGACT TAGAAGATCT 
GAACCTTCTG GAATAAGCTA TAGAACTTGG TCTCTCTACC GTCAACCTGA ATCTTCTAGA 

+2 uTyrArgPro ThrTrpGlnLeu LeuGlyLys AlaPheVal PheGlyArgLys SerArgVa 



901 ATATAGGCCA ACTTGGCAAC TTCTTGGCAA GGCTTTTGTT TTTGGAAGAA AATCCAGAGT 
TATATCCGGT TGAACCGTTG AAGAACCGTT CCGAAAACAA AAACCTTCTT TTAGGTCTCA 

+ 2 IValAspLeu AsnLeuLeuThr GluGluVal ArgLeuTyr SerCysThrPro ArgAsnPh 



961 GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC CTCGTAACTT 
CCACCTAGAC TTGGAAGATT GTCTCCTCCA TTCTAATATG TCGACGTGTG GAGCATTGAA 

+2 eSerValSer IleArgGluGlu LeuLysArg ThrAspThr IlePheTrpPro GlyCysLe 



1021 CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC CAGGTTGTCT 
GAGTCACAGG TATTCCCTTC TTGATTTCTC TTGGCTATGG TAAAAGACCG GTCCAACAGA 

+ 2 uLeuValLys ArgCysGlyGly AsnCysAla CysCysLeu HisAsnCysAsn GluCysGl 



1081 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA ATGAATGTCA 
GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT TACTTACAGT 

+2 nCysValPro SerLysValThr LysLysTyr HisGluVal LeuGlnLeuArg ProLysTh 



1141 ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC 
TACACAGGGT TCGTTTCAAT GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG 

+ 2 rGlyValArg GlyLeuHisLys SerLeuThr AspValAla LeuGluHisHis GluGluCy 



1201 CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC ATGAGGAGTG 
GCCACAGTCC CCTAACGTGT TTAGTGAGTG GCTGCACCGG GACCTCGTGG TACTCCTCAC 

+ 2 sAspCysVal CysArgGlySer ThrGlyGly 



1261 TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC AGCTCTTGCC 
ACTGACACAC ACGTCTCCCT CGTGTCCTCC TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

1321 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT CTCCATCCTT 
GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC ATACGCAATA GAGGTAGGAA 

13 81 AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG 
TTAGAGTCAA CAAACGAAGT TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC 

1441 AGGAGACATC AAACAGAATT AGGAGTTGTG CAA 
TCCTCTGTAG TTTGTCTTAA TCCTCAACAC GTT 
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z^dr! Predicted Full-length Polypeptide Sequence 

1 MSLFGLLLLT SALAGQRQGT QAESNLSSXF QFSSNKEQYG VQDPQHERII 

51 TVSTNGSIHS PRFPHTYPRN TVLVWRLVAV EENVWIQLTF DERFGLEDPE 

101 DDICKYBFVE VEEPSDGTIL CRWCGSGTVP GKQISKGNQI RIRFVSDEYF 

151 PSHPGrCIHY NIVMPQFTEA VSPSVLPPSA LPLDLLNNAI TAFSTLEDLI 

201 RYLEPERWQL DLEDLYRPTW QLLGKAFVFG RXSRWDLNL LTEEVRLYSC 

2S1 TPRNFSVSIR EELKRTDTIF WPGCLLVXRC GGNCACC LHN CNZCQCVP SK 

301 VTKKYHEVLQ LRPXTGVRGL HKSLTDVALE HHSECDCVCR GSTGG 
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VEGF_HUMAN 
PLGF_HXJMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD__HUMAN 
990126vegx 



// Alignment of VEGF-X with Other VEGFs 

* 20 * 40 * 



MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII : 50 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



60 



80 



100' 



TVS TNG S I H S PRF PHTYPRNTVLVWRLVAVEENVWI QLTFDERFGLEDPE : 100 



VEGF — HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD__HUMAN 
99012 6vegx 



120 



140 



MHLLGFFSVACSLLAAALLPGPREAPAAAA 

MYREWVWNV 

DDICKYDFVEV--EEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDE 



30 
10 
148 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
9 9012 6vegx 



160 



180 



200 
— MN 
--MP 



AFESGLDLSDAEPDAGEATAYASKDLEEQLRSVSSVDELMTVLYPEYWKM 
FMMLYVQLVQGSSNEHGPVKRSSQSTLERSEQQIRAASSLEELLRITHSE 
YFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNAITAFSTLED 



2 
2 

80 
60 
198 



220 



240 



VEGF_HUMAN : FLL SWVHWS IaALLL YLHHAKWS Q AAPMAEGGGQNHHEWKFMD - VYQ 

PLGF_HUMAN : VMRLFPCFLQLLAGLALPAVPPQQWALSAGNGSSEVEWPFQE-VW< 

VEGB_HUMAN : MSPLLRRLLLAALLQLAPAQAPVSQPDAPGHQRKWSWID 

VEGC_HUMAN : YKCQLRKGGWQHNREQANLNSRTEETIKFAAAHYNTEIIiKSIDl 

VEGD_HUMAN : DWKLWRCRLRLKS FT SMDSRS A SHRSTRF AATFYDI ETLKVXDEEWQ 

99012 6vegx : LIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTE. 



51 
51 
46 
130 
110 
248 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



260 

CHPIETLVDXFQ 
CRALERLVDWS 



CQPREWVPLTV | LMG 1 



SYPDEIEYIF 
3YPSEVEHMFS 
TVAKQL/ 



cmprevcidvgk j fgvatntf! 

' lgkstntf: 

sctprnfsvsirgelkrtdti 



280 



300 




96 
96 
91 
175 
155 
298 
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VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



TEESNITMQ 
VETANVTMi 
TGQHQVRMQ 
TSTSYtiSKT 

tstsyisk" 
skvtkkyheelqBr: 



320 

KPHQG QHIGEfflSFLQ 

rsgdr psyvewtfs 

ryps . — sqlgeBsle; 

FEflTVPLSQG PKPVTHSE 

FEQSVPLTSV PELVPvSkV, 

PKTGVRGLHKSLTD83AL: 





340 * 






SeSrPKKDRARQEK : 


141 




SEHRPLREKMKPER : 


141 


SQ 


jgEHRPKKKDSAVKP : 


135 


ITS 


SRHMSKLDVYRQVH* : 


222 




SKHLPTAPRHPYSI : 


202 


EE 


SdHVCRGSTGG : 


345 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



360 * 380 
KSVRGKGKGQKRKRKKSRYKSWSVP 



400 



DSPR 

S I IRRSLPATLPQCQAANKTC PTNYMWNNHICRCLAQEDFMFSSDAGDDS 
I RRS I Q I PEEDRC SH SKKLC P I DMLWDSNKCKCVLQEENPLAGT 



166 

139 
272 
246 



VEGF_HUMAN 
PLGF — HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
99012 6vegx 



420 



440 



TDGFHDICGPNKELDEETCQCVCRAGLRPASCGPHKELDRNSCQCVCKNK 

EDHSHLQEPALCGP 



322 
260 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



460 * 480 * 500 

CGPCSERRKHLFVQDPQTCKC-SCKNTDSRCKARQLELNER 

CGDAVPRR 

PLCPRCTQHHQRPDPRTCRCRCRRRSFLRCQGRGLELNPD 

LFPSQCGANREFDEOTCQCVCKRTCPRNQPLNPGKCACECTESPQKCLLK 
HMMFDEDRCECVCKTPCPKDLIQHPKNCSCFECKESLETCCQKHKLFHPD 



206 
149 
179 
372 
310 



VEGF_HUMAN 
PLGF_HUMAN 
VEGB_HUMAN 
VEGC_HUMAN 
VEGD_HUMAN 
990126vegx 



TCRCDKPRR- 



520 



540 



TCRCRKLRR 

GKKFHHQTCSCYRRPCTNRQKACEPGFSYSEEVCRCVPSYWKRPQMS- 
TCSCEDRCPFHTRPCASGKTACAKHCRFPKEKRAAQGPHSRKNP 



215 

188 
419 
354 
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FL_seq 

clone41 

clone20 



Variant Polypeptide Sequences 

* 40 



20 



MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERII 
MSLFGLIiLLTSALAGQRQGTOAESNLSSKFQFSSNKEQNGVQDPQHERII 
^.qiiFGL-LLLTSALAGOROGTOAESNLSSKFOFSSNKEOWGVQDPQHERII 



50 
50 
50 



FL_secj 

clone41 

clone20 



60 



80 



100 



TVSTNGSIHSPRFPHTYPRNTVL\7WRLVAVEENVWIQLTFDERFGLEDPE| 
TVSTHGSIHSPRFPHTYPRNTVLVl^LVAVEEIWWIQLTFDERFGLEDPE 
^ySTWnSTHRPPFPHTYPRMTVIiVWRLVAVEEWVWIQLTFDERFGLEDPE 



100 
100 
100 



FL^secj 

clone41 

clone20 



120 



140 



DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF| 
DDICKYDFVEVEEPSDGTILGRWCGSGTVPGKQISKGNQIRIRFVSDEYF 
nnTryvnP^7PAn?T?-p.gnr;TTT.r:Rwrc;SG f rvPGKOISKGNOIRIRFVSDEYF 



150 
150 
150 



FL__se<3 

clone41 

clone20 



160 
JGFCI 
ISNRGGKIl 

sfci: 




* 180 * 200 

ipqfteavspsvlppsalpldliinnaitafstledli 

|hts 

pqfteavspsvlppsalpldliinnaitafstledli 



200 
167 
200 



FL_seq 

clone41 

clone20 



* 220 * 240 * 

RYLEPERWQLDLEDLYRPTWQLLGKAFWGRKSRVVDLNLLTEEVRLYSC 



RYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRVVDLNLLTE- 



250 
243 



FL_se<g 

clone41 

clone20 



260 * 280 * 300 

TPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSK 



300 



FIi_seci 
clone41 
clone2 0 



* 320 * 340 

VTKKYHEVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 

EVLQLRPKTGVRGLHKSLTDVALEHHEECDCVCRGSTGG 



345 
282 
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Primers for Expression of VEGF-X 

E.coti expression of domain- 

vegx-6 AATTGGATCCGAGAGTGGTGGATCTGAACC 

vegx-7 AATTGGATCCGGGAAGAAAATCCAGAGTGG 

vegx-8 GGTTGAATTCATTATTTTTTAGTAACTTTGCTTGGGACAC 

vegX-9 AATTGAATTCATTATCCTCCTGTGCTCCCTC 

Baculovirus/insect ceii expression of fuiNength protein* 
vegbad 

AATTGGATCCGGAGTCTCACCATCACCACCATCATGAATCCAACCTGAGTAGTAAATTC 
C 

vegbac2 AATTGAATTCGCTATCCTCCTGTGCTCCCTCTGC 
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>3993180H1 LUNGNON03 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAG^GGATA^C 
GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 
CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 
AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

T^CAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTG 

TCATCTTCATCATTTACAGT<K»TTCTGAAAGAG<»GACATC 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATC 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

«^*>^*?QH70H1 ADRETUTO 1 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCA 

TGAG^A^TCTG^CTGT^TGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAG^ 

AGTGCAGTGGCTCATTCTATTAGAGAACGTATGCGTTATCTCC^ 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3979767H1 LDNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAG^ 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATT^ 

ACATCAAACAGAATTAGGJM3TTGTGCAACAGCTCTTTTGAGM 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAG 
GTTATCTCCATCCTTAATCTCAGTTGTTra 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAG 
AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>4825396H1 BLADDIT01 INCYTE • 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATT 

GCAAT^BAATGTC^ATGTGTC^CAAGCA 

AGGG^ATTGCACAAATCACTCACCGACGTGGCCCTGCAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 

aggatagccgcatcaccacca 

IgaaaatccagagtggtggatctgLccttctaacagaggagg^ 

GTC^ATA^G^GAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTG 

GTGC(^rGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAG 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>1302516H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGAT'TTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGA 
AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAA 

ACTAAAAAATACCACGAGGTCC 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAANAGGAGA^ 
GAGGA^GCCTAAA^GACAG^AGAAAAGGTCTTCAATCGTGGAA 

GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGT 
TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 

>4713188H1 BRAIHCT01 INCYTE 

rAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTC 

CTCTTGCCC AG AGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGT ATGC GTT ATCTCCATCCTTAATCTC AGTTG - 
TTGCT 

>458823H1 KERANOT01 INCYTE _ 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACCTM 

GTTTGNTTC AAGG ACCTTTCATCTTCAGGATTT AC AGTGCATTCTGAAAGAGGAGACA •. « 

?IIeI™T^GAG^ 

ATAGATC 

>1303909H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCTTCTAACAGAGGAGGTAAGATTATAC 

aSt^Ic^tcgtaac^ctcagtxttccataagggaagaactaa^ 

CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAG 
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>2 73 9 2 1 1H1 OVARNOT0 9 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTC 

GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAG^ 

TATTCCACTAGCTGGGTTCTGTATTTCAGTTC 

GTGAGCACCTGAT 

>3325591H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTC 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTC 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>3 73 3 5 6 5H1 SMCCNOS01 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTC^ 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAATC 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTA 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT^ 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATC 

CTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATT^^ 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 

>4507477H1 OVARTDT01 INCYTE 

GGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTC 

GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATCT 
T AAAAT C GT AT AAAATCTGG A 

>4163378H1 BRSTNOT3 2 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 

ATGTCCTGGGCCTAAAATCGTATA 
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>2054675H1 BEPINOT01 INCYTE 

AAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATT 

TCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTT 

TATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAA 

TCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTA 

>3993180H1 LUNGNON03 INCYTE 

CACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGNGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCC 

GCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCAT 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAG 

AATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCTAAAGGACAGGAGAANAGGTCTT 

>3510192H1 CONCNOT01 INCYTE 

TGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTT 

TCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAG 

GAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTT 

TCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTT 

>4164633H1 BRSTNOT32 INCYTE 

CTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTANTTATGACAA^ 

GTAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATG 
TATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCA 
AAGACTTTTTGANAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGGGGAGAAAAT 
>2559870H1 ADRETUTO 1 INCYTE 

CACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCA 

TGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGGGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGC 

AGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCA 

TCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGA 

>3817470H1 BONSTUT01 INCYTE 

TTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTT 

AATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTC 
ACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATA^ 
T AATCTGTT AAATATATCTATTTTT AC C AAAGGTATTT AAT ATTC TTT 
>3 979767H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAG 

ACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTG 

GAAAGAANATTAAATGTTGTATTAAATAGACACCAGCT 

>3980011H1 LUNGTUT08 INCYTE 

GGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGC 

GTTATCTCCATCCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACATGCATTCTGAAAGAGGAGA 

CATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGG 

AAAGAAAATTAAATGTTGTATTAAATAGATCACCA 

>4825396H1 BLADDIT01 INCYTE 

GAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATT 
GCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTC 
AGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGG 

AGGATAGCCGCATCACCACCA 

>3073703H1 BONEUNT0 1 INCYTE 

AGAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTCAGT 

GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTGGTGGGAACT 

GTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACCACGAGGTCCTTCAG 

TTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCA 

>862169H1 BRAITUT03 INCYTE 

AGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCA 
TTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTTTTTGAAAATAATTAAA 

CTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTT 
GGGGAAATCTGAGCCTAGC 

>4201385H1 BRAITUT29 INCYTE 

TTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGAC^ 

TAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATCT 
TCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCTTTGACATATAACTGTTGGCTTTT 
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CTAATCTGTTAAATATATCTATTTTTACCAAAGGTATTTAATAT 
>1302516H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTG^ 

AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTC 

CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCCACAATTGC^ 

ACT AAAAAAT AC C AC G AGG TC C 

>3684109H1 HEAANOT01 INCYTE 

ATTTCATCTTCAGGATTTACAGTGCATTCTGAAA^ 

GAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAANAAAATTAAATGTTGTATTAAATAG^ 
GTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTC 
TACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTT 
>2549720H1 LUNGTUT06 INCYTE 

TTAGCTTGGNAAATTTTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATA 

AATACATGTATTTCATTCTCGTATGGTCCTAGAGTTAGATTAATCTGCATTTTAA 

AAGTTGCAAAGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTC 
TATGANAGTAG 

>877279HX LUNG AST 01 INCYTE 

CTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAACA 

GATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGT 

TTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGGCTTTTTGAAAATAATTAAATTATC 

TGTTATTGGNGG LV * 
>4713188H1 BRAIHCT01 INCYTE 

CAAAGTTACTAAAAAATACCACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCG 

ACGTGGCCCTGGAGCACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAG 

CTCTTGCCCAGAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGT 
TTGCT 

>2171082H1 ENDCNOT03 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTT 
TATTCTCCTTTTGACATTATAACTC^ 

>875860H1 LUNGAST01 INCYTE 

CTGGATTTTTCATATTTCTTATTAAAATTC 

TGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTATATTCTCCT 
TTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTACCAAAGGTATTTAATATTCTTTTTTAT 

>706168H1 SYNORAT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGANCTATGTTGCTATGAAT 

TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTC 

^CATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCrrTATCTTCAOTTTATCGA 

>458823H1 KERANOT01 INCYTE 

ANGAGTTGCCCAGAGCTGTGCAGTGCAGTGGCTGAT^CTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTT 

CAACAGCTCTTTTGAGAGGAGGCCTAAAGGNCAGGAGAAAAGGTCTTCAATCGTGGAAAGAA^ 

ATAGATC 

>538436HI LNODNOT02 INCYTE 

AAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCT 
CATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAA^ 

TCCTGTTATTG G AG ATGAAAATAAAAAGCAACTTATGAAAGTAGACATTCAGATCC AGCC AT^ TAACCTAT 
>1303909H1 PLACNOT02 INCYTE 

AGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCT 

AGCTGCACACCTCGTAACTTCTCAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTC 
CCTGGTTAAACGCTGTGGTGGGAACTGTGCCTGTTXSTCTCCACAATTGCAATGAATGTC^ 
>2739211H1 OVAKNOT09 INCYTE 

GTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCAACAGCTCTTTTC 
GAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTATTAAATAGATCACCAGCTAGTTTCAGAG^ 

TATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGT 
GTGAGCACCTGAT 
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>2550343H1 LUNGTUT06 INCYTE 

TGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTT^^ 

GTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATT 

CAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCATTCTCGTATGGTGCTA 
>5321148H1 FIBPFEN06 INCYTE 

CACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGNCAAAAATACATGTATTTCATTCTCGTA 

TGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTC 

TAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAAAT^ 

CATTACTAAC 

>879495H1 THYRNOT02 INCYTE 

ATTTCATTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTA 
AGACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTC^ 

AGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCAC 
TGAAAAA 

>3 32559 1H1 PTHYNOT03 INCYTE 

TGCAACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATC 

AAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACG 

GCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACCCTAAAGCNCC 

ATGTCNNGGGCNAAAANCGAAAAAT 

>543890H1 OVARNOT0 2 INCYTE 

TTTCTAAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 

TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGNATAGAATTGGTA^ 

TTTGAAAATAATTAAATTATGATATCTTCCATTCCTGTTATTGGAGGATGGAAAATAAAAAGCAACTTATGGAAAGTAGG 

ACATTCAGATC 

> 3 7 3 3 5 6 5H1 SMCCNOS 0 1 INCYTE 

CCTTAATCTCAGTTGTTTGCTTCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGNAAGANGAGACATCAAACAG 

AATTAGGNGTTGTGCAAAAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTNCAATCGTGGAAAGNAAATT 

AAATGTTGTATNAAATNGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNCNGTATTCAGTCT 

TTCGGAACGGCTTAGGGTAATGTCAGTACAGGANAAAAACTGTGCAGTGAG 

>4641939H1 PROSTMT03 INCYTE 

GTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCCATGCTGATAGGACAGACTGGAT 

TTTNCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACATTC 

GAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATGTGTACATTT^ 

ATAACGTGGCTTT 

>2007780H1 TESTNOT03 INCYTE 

TTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTAT^ 
ATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCTAAAC 
AAAGATGATATAAAATATTGTTGCTCTGANAAAAATACATGTAT 
>3085331H1 HEAONOT03 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCrGGTTTTTAAAAAGGAACTATTT 
AAACTTGTGTCGTGCTGATAGGACAGACTGGNTTTTTCATATTTCTTATTANAATTTCTGCCATT 
CATTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTATTTCACTTTATCGATAAGTCAGT 
>3414043H1 PTHYNOT04 INCYTE 

GCTCATATTCACATATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAAT 
TAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 
TACATTCATGGTTTGGAAGAGATAAACCTGAAA 
>3705963H1 PENCNOT07 INCYTE 

ANACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAA 
TCTGGAnnnnnniuixmxinnnnnnnGCTCATAT^ 

AAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGGATTTTTCATATTTCTTAT 
TTCTGCCATTAGAAGAAGAGAACTACNTTGANGGTTTGGAAGAGATAACCCTGAAAAGANGGG 

>5137051H1 OVARDIT04 INCYTE 

AAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAAGACTNTTTGA 

TATTGGAGATGAANATAAAAAGCAACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTT^ 

AAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGC 

GTAGGAACACATCCTATTTATTGTGATGNTGTGGTTTATTAT 

>3554223H1 SYNONOT01 INCYTE 

ATTAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTC 

ACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGGCT^ 
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ctccatgtcctggcxxtaaaatcgtataaaatctggatt^^ 

ACATTCTATGTACNACAAACCTGGTTTTTAAAAAGGAAC 
>4507477H1 OVARTDTOl INCYTE 

cxx:tagtttcagagttaccatgtacgtattc^ 

GTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGf^C 
TAAAATCGTATAAAATCTGGA 

>1955646H1 CONNNOT01 INCYTE 

TGGTAAGTTGCAAAGACrTTTTGAAAATAATTAAATTATCATATCTTC 

AACTTATGAAAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATC 

ACATAAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTATATCCTAAACC 

>4163378H1 BRSTNOT32 INCYTE 

AATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGNTCTGTATTTCAGTTCCTT^ 

GCTTAGGGTAATGTCAGTACAGGAAAAAAGCTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
ATGTCCTGGGCCTAAAATCGTATA 

> 5 0 9 5 14 1H1 EPIMNONO 5 INCYTE 

AGATAAACCTGAAAAGAAGAGTGGCCTT^^ 

TATTCTCCTTTTGACATTATAACTGNTGGCTTTTCTAANCNTGTTAAATATATCTATTTTTACCAAAGGTAT^ 
CTTT 

>943826H1 ADRENOT03 INCYTE 

TATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTC 
AATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATG 
>3451273H1 UTRSNONO 3 INCYTE 

TTTTTTNTTTTGCTCATATTC^ 

TTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACAGACTGG^ 
AAGA 

>1402278H1 LATRTUT02 INCYTE 

GTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGCCTAAA 
ATCGTATAAAATCTGGAnnnnnnrmnnrmrmnnnnGCTCATATTCACATAT^ 

CCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCGTGCTGATAGGACA 
CTTA 

>43 6I191H1 SKIRNOT01 INCYTE 

GCAAAGACTTTTTGANAATNATTAANTTATCATATC 

AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCNCAGAAAAACATAAAGC 

ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCNATTTATTGTGNTGTN 
GNGGTTTTATGATC 

>1307017H1 PLACNOT02 INCYTE 

TGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCCATGTCCTGGGC 
CTAAAATCGTATAAAATCTGGAJinimnnrmnnnnnnnnnnGCTCATATTCACA 

ACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGTGTCATGCTGATAGGACAG 
TAT 

>5032225H1 HEARFET03 INCYTE 

AATTATCATATCTTCCATTCCTGTTATTGGAGATGNAAATAAAAAGCAACTTATGAAAGTAGACATTCA 
TA CTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTG 

^f mGCGTGCmCTGTGCAGmM ^ 
>3732621H1 SMCCNOS01 INCYTE 

ANAGATGATATAAAANATTGTTGCTCTGACAANNATACATGTATTTCATTCT 
CNTTTTAAAAAACTGANTTGGAATAGANTTGGTAAGTTGCAAAGNCNTTTGAA 
>3530274H1 BLAONOTO 9 INCYTE 

TTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGAAAGTAGACAT^ 
CCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAA^ 

TGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTAAACTCTGTTCCATACACTTG 

TATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCA 

>3530249H1 BLAONOTO 9 INCYTE 

CTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGANAGTAGACATTCAGATC 
TCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACATAAAGCACCTTGAAAAAGA 
GTGCTGTGCTGTGCAGTAGGAACACATCCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTC 
TGTATAAATACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACCTC 
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^72^ 




VEGFEl 


AAAATGTATGGATACAACTTAC 


22 


VEGFE2 


GnTGATGAAAGATTTGGGCTTG 


23 


VEGFE3 


TTTCTAAAGGAAATCAAATTAG 


22 


VEGFE4 


GATAAGATTTGTATCTGATG 


20 


VEGFE5 


GATGTCTCCTCTTTCAG 


17 


VEGFE6 


GCACAACTCCTAATTCTG 


18 


VEGFE7 


AGCACCTGATTCCGTTGC 


19 


VEGFE8 


TAGTACATAGAATGTTCTGG 


20 


VEGFE9 


AAGAGACATACTTCTGTAC 


19 


VEGFEl 0 


CCAGGTACAATAAGTGAACTG 


21 
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NLLT EEV RLY 



+ 3 M N I F L L 



] 



1 AGGAAATCAA ATT AG GATAA GATTTGTATC TGATGAATAT TTTCCTTCTG 
AACCTTCTAA CAGAGGAGGT AAGATTATAC 

TCCTTTAGTT TAATCCTATT " CTAAACATAG ACTACTTATA AAAGGAAGAC 
TTGGAAGATT GTCTCCTCCA TTCTAATATG 

+ 3SCTP RNF SVS IREE LKR 
TDTIFWPGCL 



81 AGCTGCACAC CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG 
AACCGATACC ATTTTCTGGC CAGGTTGTCT 

TCGACGTGTG GAGCATTGAA GAGTCACAGG TATTCCCTTC TTGATTTCTC 
TTGGCTATGG TAAAAGACCG GTCCAACAGA 
-2 



+3 LVK RCGG NCA CCL HNCN 
ECQCVPSKV wi-W 



161 CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC C ACAATTG C A 
ATGAATGTCA ATGTGTC CCA AGCAAAGTTA 

GGACCAATTT GCGACACCAC CCTTGACACG GACAACAGAG GTGTTAACGT 
TACTTACAGT TACACAGGGT TCGTTTCAAT 

-2 



+ 3TKKY HEV LQLR P K T GVR 
GLHKSLTDVA 



DCTNHSPTWP VSG 



2 41 CTAAAAAATA CCACGAGGTC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG 

-2 

[ 



+ 3LEHH EEC DCV CRGS TGG 
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> 

+ 2 VQREHRR 
IAAS PPA ALA 

] 



+ 1 WST MRSV T V C AEG AQED 
SRI TTS SSC 



321 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTC GTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QS .CA VQW L I L LENV CVI 
SIL NLSC L L Q 



+ 1 P E L C SAV ADSI RER MRY 

LHP 



401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC AT AC GC AATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 GPF IFRI YSA F 



481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTCACG TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTC GAGA 

561 TTTGAGAGGA GGC CTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTCCTCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 

CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTC G AGGT AC AGGACC CG 

801 CTAAAATCGT ATAAAATCTG GA 
GATTTTAGCA TATTTTAGAC CT 
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N L L +3 T E E V R L Y M N I F L L 



] 



^ _ 1 AGGAAATCAA ATTAGGATAA GATTTGTATC TGATGAATAT TTTCCTTCTT 
AACCTTCTAA CAGAGGAGGT AAGATTATAC ITCCTTCTG 

TCCTTTAGTT TAATCCTATT CTAAACATAG ACTACTTATA AAAGOAAPAO 
TTGGAAGATT GTCTCCTCCA TTCTAATATG * AAAGGAAGAC 

+ 3SCTP RNF SVS IREE t tr p 
TDTIFWPGCL 15 ± K E E LKR 

~ ] 

-2 



E C Q +3 C \ V P K S \ °V G G N C A C C L H N C N 



A^^°SSS^ C SS^^ T ™ CTGTTGTCTC CACAATTGCA 

TACTTACAGT^TACA^AGG^T^T^GTTTCAAT^^^^^^^^ GACAACAGAG GTGTTAACGT 
-2 



G L H +3 K T s f I l! EV L ° LR P K T G V R 

oijHK slt dva 



+1 

DCTNHSPTWP VSG 



GGATT??irA C I^^J A CCACGAGG TC CTTCAGTTGA GACCAAAGAC CGGTGTCAGG 
GGATTGCACA AATCACTCAC CGACGTGGCC 

GATTTTTTAT GGTGCTCCAG GAAGTCAACT CTGGTTTCTG GCCACAGTCC 
CCTAACGTGT TTAGTGAGTG GCTGCACCGG ^ nc.lt GCCACAGTCC 
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-2 
-[ 



+ 3LEHH EEC DCV CRGS TGG 



> 



+ 2 VQREHRR 
IAAS PPA ALA 

] 



+ 1 WST MRSV TVC AEG AQED 
SRI TTS SSC 



3 21 CTGGAGCACC ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG 
ATAGCCGCAT CACCACCAGC AGCTCTTGCC 

GACCTCGTGG TACTCCTCAC ACTGACACAC ACGTCTCCCT CGTGTCCTCC 
TATCGGCGTA GTGGTGGTCG TCGAGAACGG 

+ 2QSCA VQW LIL LENV CVI 
S I L NLSC LLQ 



+ 1PELC SAV ADSI RER MRY 

LHP 
> 

401 CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 
CTCCATCCTT AATCTCAGTT GTTTGCTTCA 

GTCTCGACAC GTCACGTCAC CGACTAAGAT AATCTCTTGC AT AC GC AATA 
GAGGTAGGAA TTAGAGTCAA CAAACGAAGT 

+ 2 G P F IFRI YSA F 
> 

481 AGGACCTTTC ATCTTCAGGA TTTACAGTGC ATTCTGAAAG AGGAGACATC 
AAACAGAATT AGGAGTTGTG CAACAGCTCT 

TCCTGGAAAG TAGAAGTCCT AAATGTC AC G TAAGACTTTC TCCTCTGTAG 
TTTGTCTTAA TCCTCAACAC GTTGTCGAGA 

561 TTTGAGAGGA GGCCTAAAGG ACAGGAGAAA AGGTCTTCAA TCGTGGAAAG 
AAAATTAAAT GTTGTATTAA ATAGATCACC 

AAACTCTCCT CCGGATTTCC TGTC C TCTTT TCCAGAAGTT AGCACCTTTC 
TTTTAATTTA CAACATAATT TATCTAGTGG 

641 AGCTAGTTTC AGAGTTACCA TGTAC GTATT CCACTAGCTG GGTTCTGTAT 
TTCAGTTCTT TCGATACGGC TTAGGGTAAT 

TCGATCAAAG TCTCAATGGT ACATGCATAA GGTGATCGAC CCAAGACATA 
AAGTCAAGAA AGCTATGCCG AATCCCATTA 

721 GTCAGTACAG GAAAAAAACT GTGCAAGTGA GCACCTGATT CCGTTGCCTT 
GGCTTAACTC TAAAGCTCCA TGTCCTGGGC 
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CAGTCATGTC CTTTTTTTGA CACGTTCACT CGTGGACTAA GGCAACGGAA 
CCGAATTGAG ATTTCGAGGT ACAGGACCCG CAACGv,AA 

801 CTAAAATC GT ATAAAATCTG GATTTTTTTN TTTTTTTTTG CGCATATTO A 
CATATGTAAA CCAGAACATT CTATGTACTA CGCATATTCA 

^m^^. m ™, GATTTTAGCA TATTTT AGAC CTAAAAAAAN AAAAAAAAAC GCGTATAAGT 
GTATACATTT GGTCTTGTAA GATACATGAT ^iAiMGT 

881 CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 
CGTGCTGATA GGACAGACTG GATTTTTCAT AAACTTGTGT 

GTTTGG AC C A AAAATTTTTC CTTGATACAA CGATACTTAA TTTGAACACA 
GCACGACTAT CCTGTCTGAC CTAAAAAGTA HTGAACACA 

961 ATTTCTTATT AAAATTTCTG CCATTTAGAA GAAGAGAACT ACATTCATGG 
TTTG G AAG AG ATAAACCTGA AAAGAAGAGT ACATTCATGG 

TAAAGAATAA TTTTAAAGAC GGTAAATCTT CTTCTCTTGA TGTAAGTAro 
AAACCTTCTC TATTTGG AC T TTTCTTCTCA 1 GTAAGTACC 
-3 

^,n-r^L. G ^ TTATCT TCA CTTTATC GATAAGTCAG TTTATTTGTT TCATTGTGTA 
CATTTTTATA TTCTCCTTTT GACATTATAA 

CCGGAATAGA AGTGAAATAG CTATTCAGTC AAATAAACAA AGTAACACAT 
GTAAAAATAT AAGAGGAAAA CTGTAATATT avj 1 AACACAT 

-3 f 

1121 CT GTTG GC TT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 
TTAATATTCT TTTTTATGAC AACTTAGATC <-v_AAAGGTAT 

GACAACCGAA AAGATTAGAA CAATTTATAT AGATAAAAAT GGTTTCCATA 
AATTATAAGA AAAAATACTG TTGAATCTAG JA-l_AIA 

1201 AACTATTTTT AGCTTGGTAA ATTTTTCTAA ACACAATTGT TATAGCCAGA 
GGAACAAAGA TGATATAAAA TATTGTTGCT TAGCCAGA 

CC^TTOrr^^^S^""*^ T0T0TT " CA ATATCGGTCT 

atc^ca4tt C ? a K^o T SS a TCATTCTCGT «»»**» agttagatta 

GA CTGTTT TT ATGTACATAA AGTAAGAGCA TACCACGATC TCAATCTAAT 
TAGACGTAAA ATTTTTTGAC TTAAC C TT AT ii-AATCTAAT 

^^^"_ GAATTGGTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 
TCCATTCCTG TTATTGGAGA TGAAAATAAA 1 v_ A 1 ATCT 

CTTAACCATT CAACGTTTCT GAAAAACTTT TATTAATTTA ATAGTATAGA 
AGGTAAGGAC AATAACCTCT ACTTTTATTT ftlAbiATAGA 

CTTT^rr^?A^^^ TG ^ AAGTAGA CATTCAGATC CAGCCATTAC TAACCTATTC 
CTTTTTTGGG GAAATCTGAG CCTAGCTCAG 

TTCGTTGAAT ACTTTCATCT GTAAGTCTAG GTCGGTAATG ATTGGATAAG 
GAAAAAACCC CTTTAGACTC GGATCGAGTC ATTGGATAAG 
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1521 AAAAACATAA AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT 
GCTGTGCTGT GCAGTAGGAA CACATCCTAT 

TTTTTGTATT TCGTGGAACT TTTTCTGAAC C GTCGAAGG A CTATTTCGCA 
C G AC AC GACA CGTCATCCTT GTGTAGGATA 

1601 TTATTGTGAT GTTGTGGTTT TATTATCTTA AACTCTGTTC CATACACTTG 
TATAAATACA TGGATATTTT TATGTACAGA 

AATAACACTA CAACACCAAA ATAATAGAAT TTGAGACAAG GTATGTGAAC 
ATATTTATGT ACC TATAAAA ATACATGTCT 



1681 AGTATGTCTC TTAACCAGTT CACTTATTGT ACCTGG 
TCATACAGAG AATTGGTCAA GTGAATAACA TGGACC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37641 PCT/US99/30503 

33/54 

DNA and poI>-peptide sequence used for mammalian cell expression 

+ 1 ms lfgllllcsalagq r 
1 GGATCCAAAA TGAGCCTCTT CGGGCTTCTC CTGCTGACAT CTGCCCTGGC CGGCCAGAGA 

♦IqStq e E S NLS S K F Q ?SS N K E 
61 CAGGGGACTC AGGCGGAATC CAACCTGAGT AGTAAATTCC AGTTTTCCAG CAACAAGGAA 

+ 1 Q N G V QDP QHE RIIT VST NGS 
121 CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA CTGTGTCTAC TAATGGAAGT 

+ 1IHSP R F P HTY PRNT V L* V W R L 
181 ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA CGGTCTTGGT ATGGAGATTA 

+ 1 V A V E £NV WIQ LTFD ERF GLE 
241 GTAGCAGTAG AG3AAAATGT ATGGATACAA CTTACGT7TG ATGAAAGATT TGGGCTTGAA 

♦ 1 D P E D DIC K Y D FVEV EEP SDG 
301 GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG TTGAGGAACC CAGTGATGGA 

+ 1TILG RWC GSG TVPG KQI SKG 
361 ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG GAAAACAGAT TTCTAAAGGA 

+ 1NQIR I R F VSD E Y ? P SEP GFC 
421 AATCAAATTA GGA7AAGATT TGTATCTGAT GAATATTTTC CTTCTGAACC AGGGTTCTGC 

+ 1IHYN IVM P Q F TEAV SPS VLP 
481 ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG TGAG7CCTTC AGTG CTACCC 

+ 1PSAL PLD LLN NAIT AFS TLE 
541 CCTTCAGCTT TGCCACTGGA CCTGCTTAAT AATGCTATAA CTGCCTTTAG TACCTTGGAA 

+ 1DLIR YLE PER WQLD LED LYR 
601 GACCTTATTC GA7ATCTTGA ACCAGAGAGA TGGCAGTTGG ACTTAGAAGA TCTATATAGG 

+ 1 P T W Q 1LG KAF VFGR K S R VVD 
661 CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA GAAAATCCAG AGTGGTGGAT 

+ 1LNLL TEE VRL YSCT ?RN FSV 
721 CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA CACC7CGTAA CTTCTCAGTG 

+ 1SIRE ELK RTD TIFW PGC LLV 
781 TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTCT GGCCAGGTTG TCTCCTGGTT 

+ 1KRCG GNC ACC LKNC NEC QCV 
841 AAACGC7GTG GTGGGAACTO TGCCTGTTGT CTCCACAA7T GCAA7GAATG TCAATGTGTC 

+ 1 P S K V TKK YHE V L C L ?. P K TG V 
901 CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAS7 TGA5ACCAAA GACCGGTGTC 

♦ 1RGLH KSL TDV ALEH HEE CDC 
961 AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGA3C ACCA73AGGA GTGTGACTGT 



1021 GTGTGCAGAG GGAGCACAGG AGGATCTAGA GGGCCCT77G AAGG7AAGCC TATCCC7AAC 

*1 E LLC, LPS T R T G H H H V U " 

1081 CCTCTCCTCG GTCTCGATTC TACGCGTACC GGTCA7CA7C ACCA7 CACCA TTGA 
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/£/%P DNA and polypeptide sequence used for baculovirus/insect cell expression 

1 GAATTCAAAG GCCTGTATTT TACTGTTTTC GTAACAGTTT TGTAATAAAA AAACCTATAA 

+3 mkf Ivn.valv fmv vyi syi 
61 ATATGAAATT CTTAGTCAAC GTTGCCCTTG TTTTTATGGT CGTATACATT TCTTACATCT 

+ 3 y a DP TSH k'HHK H ES NLS SKF 
121 ATGCGGATCC GGAGTCTCAC CATCACCACC ATCATGAATC CAACCTGAGT AGTAAATTCC 

+ 3QF SS NKE QNGV QDP QHE R I I 
181 AGTTTTCCAG CAACAAGGAA CAGAACGGAG TACAAGATCC TCAGCATGAG AGAATTATTA 

+ 3TVST MGS IHSP RFP HTY PRN 
241 CTGTGTCTAC TAATGGAAGT ATTCACAGCC CAAGGTTTCC TCATACTTAT CCAAGAAATA 

*3TVLV WRL VAVE ENV WIQ L T F 
301 CGGTCTTGGT ATGGAGATTA GTAG CAGTAG AGGAAAATGT ATGGATACAA CTTACGTTTG 

+ 3DERF GLE DPED DIC KYD FVE 
361 ATGAAAGATT TGGGCTTGAA GACCCAGAAG ATGACATATG CAAGTATGAT TTTGTAGAAG 

+ 3 VEEP SDG TILG RWC GSG TVP 
421 TTGAGGAACC CAGTGATGGA ACTATATTAG GGCGCTGGTG TGGTTCTGGT ACTGTACCAG 

+ 3GKQI SKG NQIR IRF VSD EYF 
4 81 GAAAACAGAT TTCTAAAGGA AATCAAATTA GGATAAGATT TGTATCTGAT GAATATTTTC 

+ 3PSEP GFC IKYN IVM PQF TEA 
541 CTTCTGAACC AGGGTTCTGC ATCCACTACA ACATTGTCAT GCCACAATTC ACAGAAGCTG 

♦ 3 V S P S V L P FSAL PLD LLN N A I 
601 TGAGTCCTTC AGTGCTACCC CCTTCAGCT7 TGCCACTGGA CCTGCTTAAT AATGCTATAA 

+ 3TAFS T L E DLIR YLE PER WQL 
661 CTG CCTTT AG TACCTTGGAA GACCTTATTC GATATCTTGA ACCAGAGAGA TGGCAGTTGG 

+ 3DLED LYR PTWQ LLG KAF VFG, 
721 ACTTAGAAGA TCTATATAGG CCAACTTGGC AACTTCTTGG CAAGGCTTTT GTTTTTGGAA 

+ 3RKSR VVD LNLL TEE VRL YSC 
781 GAAAATCCAG AGTGGTGGAT CTGAACCTTC TAACAGAGGA GGTAAGATTA TACAGCTGCA 

♦ 3TPRN FSV SIRE ELK RTD TIF 
841 CACCTCGTAA CTTCTCAGTG TCCATAAGGG AAGAACTAAA GAGAACCGAT ACCATTTTC7 

+ 3WPGC LLV KRCG GNC ACC LKN 
901 GGCCAGGTTG TCTCCTGGTT AAACG CTGTG GTGGGAACTG TGCCTGTTGT CTCCACAAT7 

+ 3CNE C QCV PSKV TKK YHE V L Q 
961 GCAA7GAATG TCAATGTGTC CCAAGCAAAG TTACTAAAAA ATACCACGAG GTCCTTCAGT 

+ 3LRPK TGV RGLH.KSL T D V ALE 
1021 TGAGACCAAA GACCGGTGTC AGGGGATTGC ACAAATCACT CACCGACGTG GCCCTGGAGC 

+ 3HHEE SDC VCRG STG G 
1081 ACCATGAGGA GTGTGACTGT GTGTGCAGAG GGAGCACAGG AGGATAGCTC TAGA 
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' xc *- DNA and polypeptide sequence used for E*coli expression 



♦ 3 



-2 S NKTNN MWM NNM LflT 



1 CGCAGACTAA TTCGAGCTCG AACAACAACA ACAATAACAA TAACAACAAC CTCGGGATCG 

+ 3 ^ — 2 E 1 S E p ESNL SSK FQF SSK 

61 AGGGAAGGAT TTCAGAATTC GAATCCAACC TGAGTAGTAA ATT CCAGTTT TCCAGCAACA 

+ 3KEQN GVQ DPQH ERI ITV STN 
X21 AGGAACAGAA CGGAGTACAA GATCCTCAGC ATGAGAGAAT TATTACTGTG TCTACTAATG 

♦ 3GSIH SPR FPHT YPR NTV LVW 
181 GAAGTATTCA CAGCCCAAGG TTTCCTCATA CTTATCCAAG AAATACGGTC TTGGTATGGA 

♦ 3 R L V A VEE N V W I QLT FDE R F G 
241 GATTAGTAGC AGTAGAGGAA AATGTATGGA TACAACTTAC GTTTGATGAA AGATTTGGGC 

♦ 3LEDP HDD ICKY DFV EVE EPS 
301 TTGAAGACCC AGAAGATGAC ATATGCAAGT ATGATTTTGT AGAAGTTGAG GAACCCAGTG 

+ 3DGTI LGR WCGS GTV PGK QIS 
361 ATGGAACTAT ATTAGGGCGC TGGTGTGGTT CTGGTACTGT ACCAGGAAAA CAGATTTCTA 

+ 3KGN-Q IRI RFVS D E Y FPS EPG 
421 AAGGAAATCA AATTAGGATA AGATTTG TAT CTGATGAATA TTTTCCTTCT GAACCAGGGT 

+ 3 F C I H Y N I VMPQ FTE AVS PSV 
4 81 TCTGCATCCA CTACAACATT GTCATGCCAC AATTCACAGA AGCTGTGAGT CCTTCAGTGC 

+ 3 L P ? S ALP LDLL N N A ITA FST 
541 TACCCCCTTC AGCTTTGCCA CTGGACCTGC TTAATAATGC TATAACTGCC TTTAGTACCT 

♦ 3 L E D L IRY LEPE RWQ LDL EDL 
601 TGGAAGACCT TATTCGATAT CTTGAACCAG AGAGATGGCA GTTGGACTTA GAAGATCTAT 

+ 3YRPT W Q L LGKA FVF GRK SRV" 
661 ATAGGCCAAC TTGGCAACTT CTTGGCAAGG Li riTGTTTY TGGAAGAAAA TCCAGAGTGG 

♦ 3VDLN LLT EEVR LYS CTP RNF 
721 TGGATCTGAA CCTTCTAACA GAGGAGGTAA GATTATACAG CTGCACACCT CGTAACTTCT 

+ 3SVSI REE LKRT DTI FWP GCL 
781 CAGTGTCCAT AAGGGAAGAA CTAAAGAGAA CCGATACCAT TTTCT G GCCA GGTTGTCTCC 

+ 3LVXR CGG NCAC CLK NCN ECQ 
841 TGGTTAAACG CTG7GGTGGG AACTGTGCCT GTTGTCTCCA CAATTGCAAT GAATGTCAA7 

♦ 3CV?s KVT KKYH EVL QLR PKT 
901 GTGTCCCAAG CAAAGTTACT AAAAAATACC ACGAGGTCCT TCAGTTGAGA CCAAAGACCG 

♦ 3 G V ?. G LHK SLTD VAL EHH EEC 
961 GTGTCAGGGG ATTGCACAAA TCACTCACCG ACGTGGCCCT GGAGCACCAT GAGGAGTGTG 



♦ 3 D 



n H 



1021 ACTGTGT3TG CAGAGGGAGC ACAGGAGGAC ATCATCACCA TCACCATTGA TCTAGAGTCG 
1081 ACCTGCAGGC AAGCTT 
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(A) Mammalian ceil expression 



medium 



purified 
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reduced monomer 





putative nonreduced dimer 



(B) E.coli expression 

M + 




putative dimer 



monomer 
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DNA and polypeptide sequence used for E.coli expression of the PDGF-like domain 

+ 3 MR G fi H H HHH G M ASM 

1 AAGGAGATAT ACATATGCGG GGTTCTCATC ATCATCATCA 7CATGGTATG GCTAGCATGA 

+ 3 T G G O O M G RDLY DD D DKD P G R 
61 CTGGTGGACA GCAAATGGGT CGGGATCTGT ACGACGATGA CGATAAGGAT CCGGGAAGAA 

+3 K S R V V D I* NLLT EEV RLY SCT 
121 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGAGGT AAGATTATAC AGCTGCACAC 

+3PRNF SVS IREE LKR TDT IFW 
181 CTCGTAACTT CTCAGTGTCC ATAAGGGAAG AACTAAAGAG AACCGATACC ATTTTCTGGC 

+ 3PGCL LVK RCGG N C A CCL KNC 
241 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTGC CTGTTGTCTC CACAATTGCA 

+ 3NECQ CVP SKVT KKY HEV L.QL 
3 01 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGGTC CTT CAGTTGA 

+ 3RPKT GVR GLKK SLT D V A LEH 

3 61 GACCAAAGAC CGGTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

+ 3HEEC DCV CRGS TGG 
421 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG A7AATGAATT CGAAGCTTGA 

4 81 TCCGGCTGCT AACAAAGCCC 
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Expression of PDGF domain in Kcoli 

1 2 3 
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DNA and polypeptide sequence used for E.coli expression of the CUB-like domain 

+ 2 MA MDTG TNS DPE SH H H H U H 

1 GGCGATGGCC ATGGATATCG GAATTAATTC GGATCCGGAG TCTCACCATC ACCACCATCA 

+ 2 ESN LSSK F Q F SSK KEQN GVQ 
61 TGAATC CAAC CTGAGTAGTA AATTCCAGTT TTCCAGCAAC AAGGAACAGA ACGGAGTACA 

+ 2 DPQ HERI ITV STN GSIH SPR 
121 AGATCCXCAG CATGAGAGAA TTATTACTGT GTCTACTAAT GGAAGTATTC ACAGCCCAAG 

+ 2 FPH TYPR NTV L VW R L V A VEE 
181 GTTTCCTCAT ACTTATCCAA GAAATACGGT CTTGGTATGG AGATTAGTAG CAGTAGAGGA 

+2 NVW IQLT FDE RFG LEDP EDD 
241 AAATGTATGG ATACAACTTA CGTTTGATGA AAGATTTGGG CTTGAAGACC CAGAAGATGA 

+ 2 ICK YDFV EVE EPS DGTI LGR 
3 01 CATATGCAAG TATGATTTTG TAGAAGTTGA GGAACCCAGT GATGGAACTA TATTAGGGCG 

+ 2 WCG S G T V PGK QIS KGNQ IRI 

3 61 CTGGTGTGGT TCTGGTACTG TACCAGGAAA ACAGATTTCT AAAGGAAATC AAATTAGGAT 

+ 2 RFV £ D E Y FPS EPG FCIH Y N I 
421 AAGATTTGTA TCTGATGAAT ATTTTCCTTC TGAACCAGGG TTCTGCATCC ACTACAACAT 

+ 2VMPCFTEAV 

4 81 TGTCATGCCA CAATT CACAG AAGCTGTGTA GTCGAGCTCC GTCGACAAGC TTGCGGCCGC 

S41 ACT CGAGCAC 
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Expression of the CUB domain in E.coli 



M CUB 
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Partial intron/exon structure of the VEGF-X gene 
(A) - Genomic DxNA sequences of 2 exons determined by sequencing 

agtcctcag^acccSc^ 
t ^ at atcttgaa^cag!ma^^tcgca^ 

^^^^^^^^^^^^ 

ATAGCCG<*TCACC^GCA^^Sc™ 
CCTTAATCTCAGTTCTrSGlJ^^ 

<*:taatgtcag T acac^aaI^™ 



CTAAAATCGTATAAAATCTGG A1 U 1 i 1 1 U U 1 j j iT 



^CCTGATTCCGTTGCCTTGGCTTAACTCTAAAGCTCCATGTCCTGGGC 



AATTTCTGCCATTTAGAAGAAGAGAA^TACA^T^AT^nTX^ 
TCGATAAGTCAGTTTATT^^S™^ 

aatatatctatttttaccIaaIIS^SSS^^ "^^^"ataactgttggcttttctaatcttotta 

ACXCAATTtrTTATAGCCAGA^J^^ 

^AGAS^^^^St^^ 

TCATATCTTCCA^cSSSS*^^ 

tccttttttg^aaSSg^^ 

TGCTffTGCAGTAGGA^CACATC^WTTTATTG'rGAT^^ 

TcxsATATTTTTATcrrAavGA^ 

TGCTTCTAAAATGCTTAATATcS^ 
GG CT ATTTTC«X»GAA*££a5^ 

^^AGAAAATTatscg^cgcgcgcccaagatccaccccccggaccctgagaaaacgaaagataaa 
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(B) - Location of splice sites within the cDNA sequence 

1 GAATTCCCCC 27TTGTTTAA ACCTTCGGAA CTGGTTCAGG TCCAGGTTTT GCTTTGATCC 

61 TTTTCAAAAA CTCGACACAC AGAAGAGGGC TCTAGGAAAA AGTTTTCGAT CGGATTATCT 

122 GGAAACTACC CTGCGATTCT CTGCTGCCAG AGCAGGCTCG GCGCTTCCAC CCCAGTGCAC 

1B1 CCTTCCCCTG GCGGTGGTGA AAGAGACTCG CCACTCCCTC CTTCCAAACT CCCCGCCGTG 



+3 



M S 



241 AOTGAGCTCT CACCCCAGTC AGCCAAATGA GCCTCTTCGG 

OALAG Q R Q GTOA ESN L S S X F O 
301 CCCTGGCCGG CCAGAGACAG GGGACTCAGG CGGAATCCAA CCTGAGTAGT AAAITCCAGT 

+3FSSW X E O NOVO D P Q HER I X T 
361 TTTCCAGCAA CAAGGAACAG AACGGAGTAC AAGATCCTCA GCATCAGAGA ATTATTACTC 

+3 V S T N G S I H S P R FPU T Y P SWT 
421 TCTCTACTAA TG3AAGTATT CACAGCCCAA GGTTTCCTCA TACTTATCCA ACAAATACCG 

+3 V L V W S L V A V B E N V W X Q i T F D 
481 TCTTGCTATO GAGATTAGTA GCAGTAGAGG AAAATCTATG GATACAACTT ACGT1TCATG 

♦ 3 £ R F G LSD P B V D I C X Y D F V E V 
541 AAAGATTTCG GCTTGAAGAC CCAGAAGATG ACATATGCAA GTATCATTTT CTAGAAGTTC 

♦3 B E P S D G T I Zi G R W C G S G T V P G 
BO! AGGAACCCAG TGATGGAACT ATATTAGCCC GCTGGTGZGG TTCTCGTACT GTACCACCAA 

+3 X Q I S X G N Q I R X R F V S D B Y F 9 
661 AACAGATTTC TAAAGCAAAT CAAATTAGGA TAAGATTTGT ATCTCATSAA TATTTTCCTT 

+ 3S£PG F C I KYNI VMP .Q F T E A V 
721 CTGAACCAC G GXTCTGCATC CACTACAACA TTGTCATGCC ACAATTCACA GAAGCTGTGA 

♦ IS P S V LP* S A I» P LDL LNN AIT 
781 GTCCTTCAGT GCTACCCCCT TCA3CTTTGC CACTGGACCT GCTTAATAAT GCTATAACTG 

*3 A F S T LED LIRV LEP E R W OLD 
841 CCTTTAGTAC CTTGCAAGAC CTTATTCGAT ATCTTGAACC AGAGAGATGC CAGTTGGACT 

+3 L E D L YRP TWQL LGK AFV F G R 
901 TAGAAGATCT ATATAGGCCA ACTTGGCAAC TTCTTGGCAA OOLTIUUII TTTGGAA3AA 

♦ 3KSRV VDL KLLT E U * * * S C * „ 
961 AATCCAGAGT GGTGGATCTG AACCTTCTAA CAGAGGA^T AAGATTATAC AGCTGCACAC 

+ 3 P R N F 5VS IREE.LKR TDT I F W 
1021 CTCGTAACTT CTCAGT3TCC ATAAGGGAAG AACTAAAGA3 AACCGATACC ATT7TCTGGC 

+ 3PGCL LVK RCGG MCA CCL KNC 
1081 CAGGTTGTCT CCTGGTTAAA CGCTGTGGTG GGAACTGTOC C7GT7G7CTC CACAATTGCA 

♦ 3 N E C Q CVP SKVT KKY K S | V L 0 L 
1141 ATGAATGTCA ATGTGTCCCA AGCAAAGTTA CTAAAAAATA CCACGAGpTC CTTCAGTTGA 
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♦ 3RPKT GVR GLHK SLT OVA LEH 
1201 GACCAAAGAC CC-CTGTCAGG GGATTGCACA AATCACTCAC CGACGTGGCC CTGGAGCACC 

♦3HEEC DCV CRGS TGG 
12S1 ATGAGGAGTG TGACTGTGTG TGCAGAGGGA GCACAGGAGG ATAGCCGCAT CACCACCAGC 

1321 AGCTCTTGCC CAGAGCTGTG CAGTGCAGTG GCTGATTCTA TTAGAGAACG TATGCGTTAT 

1381 CTCCATCCTT AATCTCAGTT GTTTGCTTCA AGGACCTTTC ATCTTCAGGA TTTACAGTGC 

1441 ATTCTGAAAG AC-GAGACATC AAACAGAATT AGGAGTTCTG CAACAGCTCT TTTGAGAGGA 

1S01 GGCCTAAAGG ACAC-GAGAAA AGGTCTTCAA TCGTGGAAAG AAAATTAAAT GTTGTATTAA 

1S61 ATAGATCACC AC-CTAGTTTC AGAGTTACCA TGTACGTATT CCACTAGCTG GGTTCTGTAT 

1621 TTCAGTTCTT TCGATACGGC TTAGGGTAAT GTCAGTACAG GAAAAAAACT GTGCAAGTGA 

1681 GCACCTGATT CCGTTGCCTT GCTTAACTCT AAAGCTCCAT GTCCTGGGCC TAAAATCGTA 

1741 TAAAATCTGG ATT7TTTTTT TTTTTTTTTG CTCATATTCA CATATGTAAA CCAGAACATT 

1801 CTATGTACTA CAAACCTGGT TTTTAAAAAG GAACTATGTT GCTATGAATT AAACTTGTGT 

1861 CATGCTGATA GGACAGACTG GATTTTTCAT ATTTCTTATT AAAATTTCTG CCATTTAGAA 

1921 GAAGAGAACT ACATTCATGG TTTGGAAGAG ATAAACCTGA AAAGAAGAGT GGCCTTATCT 

1981 TCACTTTATC GATAAGTCAG TTTATTTG7T TCATTGTGTA CATTTTTATA TTCTCCTTTT 

2041 GACATTATAA CTGtTGGCTT TTCTAATCTT GTTAAATATA TCTATTTTTA CCAAAGGTAT 

2101 TTAATATTCT TTTTTATGAC AACTTAGArC AACTATTT7T AGCTTGGTAA ATTTTTCTAA 

2161 ACACAATTGT TATAGCCAGA GGAACAAAGA TGATATAAAA TATTGTTGCT CTGACAAAAA 

2221 TACATGTATT TCATTCTCGT ATGGTGCTAG AGTTAGATTA ATCTGCATTT TAAAAAACTG 

2281 AATTGGAATA GAATTGGTAA GTTGCAAAGA CTTTTTGAAA ATAATTAAAT TATCATATCT 

2341 TCCATTCCTG TTATTGGAGA TGAAAATAAA AAGCAACTTA TGAAAGTAGA CATTCAGATC 

2401 CAGCCATTAC TAACCTATTC CTTTTTTGGG GAAATCTGAG CCTAGCTCAG AAAAACATAA 

24 61 AGCACCTTGA AAAAGACTTG GCAGCTTCCT GATAAAGCGT GCTGTGCTGT GCAGTAGGAA 

2S21 CACATCCTAT TTATTC-7GAT GTTGTGGTTT TATTATCTTA AACTCTCTTC CATACACTTG 

2 581 TATAAA7ACA TGGATATTTT TATGTACAGA AGTATGTCTC TTAACCAGTT CACTTATTGT 

2641 ACCTGGAAGG GCGAATTCTG CAGATATC 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCTRulc \3bis) 



A. Thcindicmions made below relate to the deposited microoijanisni or other biological material referred to in the description 



on page 



.line 15-16 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are id em i lied on an additional sheet f""| 



Name of depositary institution 

BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS (BCCM)™ 
LABORATORIUM VOOR MOLECULAIRE BIOLOGIE - PLASMIDENCOLLECTIE (LMBP) 



I Address of depositary institution (including postal code and country) 

Universiteit Gent 
K.L. Ledeganckstraat 35 
B-9000 Gent, Belgium 



Date of deposit 

20 December 1999 (20.12.99) 



Accession Number 
LMBP 3991 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) ITus information is continued on an additional sheet Q 



[D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE „/,/,, amotions are not for all debated States* 



E. SEPARATE FURNISHING OF INDICATIONS blank if not applicable) 

aS^S^ bC '" W * SUbmi ' tCU 10 lmCmali ™ al ^ "Mission 



For receiving OfTicc use only 



Q This sheet was received with the international applii 



Authorized officer 



ra„iii.|.nn.ni]u, i.i ii h ;- 



— For International Hurcau use only 
This sheet was received by thcjnt 



.19 APRIL 2000 



K^ntynglii 



Authorized officer 



Ellen Moyse 
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BELGIAN COORDINATED COLLECTIONS OF M.CROORGAHHSMS - BOCM~ 



,_rr D r^»CBr^ 

//Ua/nar/o/ia/ Form BCCM™/LMBP/BP/4/99-23 



To : Name of the depositor : Janssen Pharmeceutica N.V. 



Address Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



Identification of the microorganism: 

1.1 Identification reference given by the depositor. 



VEGF-X CUB PET22b 



I.2 



Accession 



ion number given by the International Depositary Authority: 



LMBP 3991 



■S^SSSf ^ COLLECT, ° NS ° F M.CR0ORGA N .S M S - BCC M ~ CT ™ 99 *° 503 

Page 1 of Form BCCM™/LMBP/BP/9/99-23 Viability statement 

Budapest Treaty on the international Recognition of the Oeposit of Microorganisms for 

the Purposes of Patent Procedure 

Viability statement issued pursuant to Rule 10.2 by the International Depositary 
Authority BCCM™/LMBP identified on the following page 

International Form BCCM™/LMBP/BP/9/99-23 



To : Party to whom the viability statement is issued: 
Nam « : Or Filip De Corte 

Address : Janssen Pharmaceutica N.V. 

Turnhoutseweg 30 
B-2340 Beerse 
Belgium 

I. Depositor 

M Name Janssen Pharmaceutica N.V. 



1.2 Address : Turnhoutseweg 30 

B-2340 Beerse 
Belgium 



II. 



Identification of the microorganism: 

H.1 Accession number given by the International Depositary Authority: 



LMBP 3991 



11.2 Date of the original deposit (or where a new deposit or a transfer has been 
made, the most recent relevant date) : December 20, 1999 

HI. Viability statement. 

The viability of the microorganism identified under It above was tested on 
: January 1 1 , 2000 

(Give date. In the cases referred to in Rule 10.2(a)(ii) and (Hi), refer to the most recent 
viability test). 

On that date, the said microorganism was: (mark the applicable box with a cross) 
viable 

□ no longer viable 



PCT/US99/30503 



BELGIAN COORDINATED COLLECTIONS OF MICROORGANISMS - BCCM™ 
LMBP-COLLECTION 

Page 2 of Form BCCM™/LMBP/BP/4/99-23 Receipt in the case of an original deposit 



II. Scientific description and/or proposed taxonomic designation 

The microorganism identified under I above was accompanied by: 

(mark with a cross the applicable box(es)) 

- a scientific description yes [3 no □ 

- a proposed taxonomic designation yes □ no 13 

III. Receipt and acceptance 

This International Depositary Authority accepts the microorganism identified under I 
above, which was received by it on (date of original deposit) : December 20, 1999 



IV. International Depositary Authority 

Belgian Coordinated Collections of Microorganisms (BCCM™) 

Laboratorium voor Moleculaire Biologie - Plasmidencollectie (LMBP) 

Universiteit Gent 

K.L. Ledeganckstraat 35 

B-9000 Gent, Belgium 

Signature(s) of person(s) having the power to represent the International Depositary 
Authority or of authorized official(s): 



^ - 



Date : January 12, 2000 



Martine Vanhoucke 
BCCM/LMBP curator 



